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y ESTs FOR SECRETED PROTEINS EXPRESSED 
IN TESTIS AND OTHER TISSUES 
Background of the Invention 
The estimated 50,000-100,000 genes scattered along the human chromosomes offer 
S tremendous promise for the understanding, diagnosis, and treatment of human diseases. In 
addition, probes capable of specifically hybridizing to lod distributed throughout the human 
genome find applications in the construction of high resolution chromosome maps and in the 
identification of individuals. 

In the past, the characterization of even a ^gle human gene was a painstakii^ 
1 0 process, requiring years of effort. Recent developments in the areas of cloning vectors, DNA 
sequencing, and computer tedmology have merged to greatly accelerate the rate at which 
human genes can be isolated, sequenced, mapped, and characterized. Cloning vectors such as 
yeast artifidal chromosomes (YACs) and baaerial artificial chromosomes (BACs) are able to 
accept DNA inserts ranging fix)m 300 to 1000 kilobases (kb) or 100-400 kb in length 
1 5 respectively, thereby fedlitatii^ the manipulation and ordering of DNA sequences distributed 
over great distances on the human chromosomes. Automated DNA sequencing machines 
permit the rapid sequencing of human genes. Bioinformatics software enables the 
comparison of nucleic add and protein sequences, thereby assisting in the charaaerizadon of 
human gene products. 

20 Currently, two different approaches are being pursued for identifying and 

characterizing the genes distributed along the human genome. In one approach, large 
fi:agments of genomic DNA are isolated, doned, and sequenced Potential open reading 
fi^es in these genomic sequences are identified uang btoinfiarmatics software. However, 
this approach entails sequencii^ large stretches of human DNA which do not encode proteins 

25 in order to find die protein encoding sequences scattered throughout the genome. In addition 
to requiring extensive sequencing, die bioinformatics software may mischaracterize the 
genomic sequences obtained. Thus, the software may produce &lse positives in which non- 
coding DNA is mischaracterized as coding DNA or 61se natives in which coding DNA is 
mislabeled as non-coding DNA 

30 An alternative approach takes a more direct route to identifying and characterizing 

human genes. In this approach, complementary DNAs (cDNAs) are synthesized fi-om 
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isolated messenger RNAs (mRNAs) which encode human protans. Using this approach, 
sequencing is only perfonned on DNA which is derived from protein coding portions of the 
genome. Often, only short str^ches of the cDNAs are sequenced to obtain sequences called 
expressed sequence tags (ESTs). The ESTs may then be used to isolate or purify extended 
S cDNAs which include sequences adjacent to the EST sequences. The extended cDNAs may 
contain all of the sequence of the EST which was used to obtain them or only a ponion of the 
sequence of the EST which was used to obtain them. In addition, the extended cDNAs may 
contain the full coding sequence of the gene from which the EST was derived or, 
alternatively, the extended cDNAs may include portions of the coding sequence of the gene 

10 from which the EST was derived. It will be appreciated that there may be several extended 
cDNAs which include the EST sequence as a result of alternate splicing or the activity of 
akemative promoters. 

In die past, these short EST sequences were often obtained from oligo-dT primed 
cDNA libraries. Acoxdingly, they mainly corresponded to the 3' untranslated r^on of the 

IS mRNA. bi part, the prevalence of EST sequences derived from the 3' end of the mRNA is a 
result of the faa that typical techiuques for draining cDNAs are not well suited for isolating 
cDNA sequences derived fix)m the 5' ends of mRNAs. (Adams et aL. Ncuitre 377:3-174, 
1996; YmsxetaL, Genome Res. 6:807-828, 1996). 

In addition, in those reported instances where longer cDNA sequences have been 

20 . ^obtained, the reported sequences typically correspond to coding sequences and do not include 
the fiiB 5' untrandatcd region of the mRNA from which the cDNA is derived Such 
incomplete sequences may not include the first exon of the mRNA, particulariy in situations 
where the first exon is short. Furthermore, they may not include some exons, often short 
ones, which are located upstream of splicing sites. Thus, there is a need to obtain sequences 

25 derived fix)m die S' ends of mRNAs. 

While many sequences derived from human chromosomes have pracdcal applications, 
approaches based on the identification and characterization of those chromosomal sequences 
which encode a protein product are particularly relevant to diagnostic and therapeutic uses 
Of the 50,000-100,000 protein coding genes, those genes encoding proteins which arc 

30 secreted from the cell in which they are synthesized, as well as the secreted proteins 
themselves, are particularly valuable as potential therapeutic agents. Such proteins are of^en 
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involved in cell to cell communication and may be responsible for producing a clinically 
relevant response in their target ceils. 

In fact, several secretory proteins, including tissue plasminogen activator, G-CSF. 
GM-CSF, erythropoietin, human growth hormone, insulin, interferon-ot, interferon-P, 
S interferon-y, and imerieukin-2, are currently in clinical use. These proteins are used to treat a 
wide range of conditions, including acute myocardial infiirction, acute ischemic stroke, 
anemia, di2d)etes, growth hormone deficiency, hepatitis, kidney cardnoma, chemotherapy 
induced neutropenia and multiple sclerosis. For these reasons, extended cDNAs encoding 
secreted protdns or portions thereof represent a particularly valuable source of therapeutic 

10 agents. Thus, there is a need for the identification and characterization of secreted proteins 
and the nucldc adds encoding thent 

In addition to beiz^ therapeutically usefiil themselves, secretory proteins include short 
peptides, called signal peptides, at didr amino termini which direct thdr secretion. These 
signal peptides are encoded by the signal sequences kxated at the 5\ ends of the coding 

15 sequences of genes encoding secreted proteins. Because these signal peptides will direa the 
extracellular secretion of any protein to which they are operably linked, the signal sequences 
may be exploited to direa the eflBcient secretion of any protein by operably linking the signal 
sequences to a gene encoding the protein for which secretion is desired. In addition, pordons 
of signal sequences nuiy also be used to direa the intracdlular import of a peptide or protein 

20 of interest. This may prove benefidal in gene therapy stratcges in which k is desired to 
deliver a paiticular gene produa to ceUs other than the cdi in which it is produced. Signal 
sequences encoding signal peptides also find application in simplifying protdn purificauon 
techniques. In such q)plications, the extraceUular secretion of the deared protein greatly 
facilitates purification by reducing the number of undesired proteins fi-om which the desired 

25 protein must be sdeaed. Thus, there exists a need to identify and charaaerize the 5' pon:ons 
of the genes for secretory proteins which encode signal peptides. 

Public information on the number of human genes for which the promoters ariJ 
upstream regulatory regions have been identified and charaaerized is quite limited. In ov\ 
this may be due to the difiBcuIty of isolating such r^ulatory sequences. Upstream regulatoA 

30 sequences such as transcription feaor binding sites are typically too short to be utilized a> 
probes for isolating promoters from human genomic libraries. Recently, some approachcN 
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have been devetoped to isolate human promoters. One of them consists of making a CpG 
island library (Cross, et ai.. Nature Genetics 6: 236-244, 1994). The second consists of 
isolating human genomic DNA sequences containing Spel binding sites by the use of Spel 
binding protein. (Mortloclc et oL. Genome Res. 6:327-335. 1996). Both of these approaches 

5 have their limits due to a lack of specificity or of comprehensiveness. 

The present 5' ESTs may be used to effidemly identify and isolate upsueam 
r^ulatoiy regions which control the location, developmental st^e, rate, and quantity of 
protein synthesis, as weU as the stability of the mRNA. (Theil, BioFaaors 4:87-93. 1993). 
Once identified and characterized, these regutetory regions may be utilized in gene therapy or 

10 protein purification schemes to obtain the desired amount and locations of protein synthesis 
or to inhiWt, reduce, or prevent die synthesis of undesirable ^ne produas. 

In addition, ESTs containing the 5' ends of seaetoty protein genes may include 
sequences usefiil as probes for chromosome mapping and the identification of indwiduals. 
Thus, there is a need to identify and characterire the sequences upstream of the 5' coding 

1 5 sequences of genes encoding secretory protons. 

Summary of the Invention 
The present invention relates to purified, isolated, or recombinant ESTs which include 
sequences derived fix)m the audientic 5' ends of their corresponding mRNAs. The term 

20 . "corresponding mRNA" refers to the mRNA which was the template for the cDNA synthesis 
which produced the 5' EST. These sequences wiU be refiared to hereinafter as "5' ESTs." 
As used herein, the tenn "purified" does not require absolute purity; rather, it is intended as a 
relative definitioa Individual 5' EST clones isolated fiora a cDNA library have been 
conventionally purified to electrophoredc homogeneity. The sequences obtained from these 

25 clones could not be obtained directly either from the Kbrary or fix)m total human DNA The 
cDNA clones are not naturally occurring as such, but rather are obtained via manipulation of 
a partially purified naturally occurring substance (messenger UNA). The conversion of 
mRNA into a cDNA Ubrary involves the creation of a synthetic substance (cDNA) and pure 
individual cDNA clones can be isolated from the synthetic library by clonal selection. Thus. 
30 creating a cDNA Ubraiy from messenger RNA and subsequently isolating individual clones 
from that library results in an approximately 10^-10* fold purification of die native message. 
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Purification Gff starting material or natural material to at least one order of magnitude, 
preferably two or three orders, and more preferably four or five orders of magnitude is 
expressly contemplated. 

As used herein, the term "isolated" requires that the material be removed fi-om its 
5 original environment (eg., the natural environment if it is naturally occurring). For example, 
a naturally-occurring polynucleotide present in a living animal is not isolated, but the same 
polynucleotide, separated fix)m some or all of the coexisting materials in the natural system, is 
isolated. 

As used herein, the term "recombinant'' means that the 5' EST is adjacent to 
10 "backbone" nucleic acid to which it is not adjacent in its natural environment. Additionally, to 
be "enriched" the 5' ESTs will represent 5% or more of the number of nucleic acid inserts in a 
population of nucleic add backbone molecules. Backbone molecules according to the 
present invention include nucleic adds such as expression veaors, seif-replicating nucleic 
adds, viruses, integrating nuddc adds^ and other vectors or nudeic adds used to maintain or 
IS manipulate a nuddc add insert of interest. Preferably, the enriched S' ESTs represent 15% 
or more of the number of nuddc add inserts in the population of recombinant backbone 
molecules. More preferably, the enriched 5' ESTs represent S0% or more of the number of 
nucleic add inserts in the population of recombinant backbone molecules. In a highly 
preferred embodiment, the enriched S' ESTs represent 90% or more of the number of nucldc 
20 . add inseits in the population of recombinant backbone molecules. 

''Stringent", moderate," and "low" hybridization conditions are as defined in Example 

29. 

Unless otherwise indicated, a "complementary" sequence is fiiUy complementary. 
Thus, 5' ESTs in cDNA libraries in which one or more 5' ESTs make up 5% or more 
25 of the number of nucleic acid inserts in the backbone molecules are "enriched recombinant S' 
ESTs" as defined herda Likewise, 5' ESTs in a population of plasmids in wluch one or more 
5' EST of the present invention have been inserted such that they represent 5% or more of the 
number of inserts in the plasmid backbone are " enriched recombinant 5' ESTs*' as defined 
herein. However, 5' ESTs in cDNA libraries in which S' ESTs constitute less than S% of the 
30 number of nuddc acid inserts in the population of backbone molecules, such as libraries in 
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which backbone molecules having a S' EST insert are extremely rare, are not "enriched 
recombinant 5' ESTs". 

In particular, the present invention rdates to 5' ESTs which are derived from genes 
encoding secreted proteins. As used herein, a ''secreted*' proteiii is one \^ch. when 
S expressed in a suitable host ceil, is transported across or through a membrane, including 
iran^rt as a result of signal peptides in its amino acid sequence. "Secreted" proteins include 
without limitation proteins secreted wholly (e.g. soluble proteins), or partially (e.g. receptors) 
from the cell in which they are expressed. ''Secreted" proteins also include without limitation 
proteins ^^ch are transported across the membrane of the endoplasmic reticulum. 

10 Such 5' ESTs inchide nucleic acid sequences, called signal sequences, which encode 

signal peptides which direct the extracellular secretion of the proteins encoded by the genes 
from which the 5' ESTs are derived. Generally, the signal peptides are located at the amino 
termini of secreted proteins. 

Secreted proteins are translated by ribosomes assodated with the "rough" 

IS endoplasmic redcuhim. Generally, secreted proteins are co-transtadonally transfored to the 
membrane of the endoplasmic redculum Association of the ribosome with the endoplasmic 
reticukun during translation of secreted proteins is mediated by the signal peptide. The signal 
peptide is typicaDy cleaved following h:s co-translational entry into the endoplasmic reticulum. 
After delivery to the endoplasmic reticulum, secreted proteins may proceed through the 

20 • Golgi apparatus. In the Golgi apparatus, the proteins may undergo post-translational 
modification before entering secretory vesicles which transport them across the cell 
membrane. 

The 5' ESTs of the present invention have several important applications. For 
example, they may be used to obtain and express cDNA clones which mclude the foil protein 

25 coding sequences of the corresponding gene products, including the authentic translation start 
sites derived from the 5' ends of the coding sequences of the mRNAs from which the 5' ESTs 
are derived. These cDNAs will be referred to hereinafter as "full length cDNAs," These 
cDNAs may also indude DNA derived from mRNA sequences upstream of the translation 
start site. The full length cDNA sequences may be used to express the proteins 

30 corresponding to the 5' ESTs. As discussed above, secreted proteins are therapeutically 
important. Thus, the proteins expressed from the cDNAs may be useful in u-eating or 
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controlling a variety of human conditions. The 5' ESTs may also be used to obtain the 
corresponding genomic DNA. The term "corresponding genomic DNA" refers to the 
genomic DNA which encodes the mRNA from which the 5' EST was derived. 

Akematively, the 5' ESTs may be used to obtain and express extended cDN«As 
5 encoding portions of the secreted protein. The portions may comprise the signal peptides of 
the secreted proteins or the mature proteins generated when the signal peptide is cleaved off 
The portions may also comprise polypeptides having at least 10 consecutive amino acids 
encoded by the extended cDNAs or full length cDNAs. Alternatively, the portions may 
comprise at least IS consecutive amino acids encoded by the extended cDNAs or full length 
10 cDNAs. In some embodiments, the portions may comprise at least 25 consecutive amino 
acids encoded by the extended cDNAs or full length cDNAs. In other embodiments, the 
portions may comprise at least 40 amino acids encoded by the extended cDNAs or fiill ler^h 
cDNAs. 

Antibodies wUdi spedfically recognize the entire secreted proteins encoded by the 
IS extended cDNAs, full length cDNAs, or fragments thereof having at least 10 consecutive 
amino adds, at least IS consecutive amino adds, at least 25 consecutive amino adds, or at 
least 40 consecutive ammo adds may also be obtained as described below. Antibodies which 
specifically recognize the mature protein generated when the signal peptide is cleaved may 
also be obtained as described bdow. Similarly, antibodies which specifically recognize the 
20 . signal peptides encoded by the extended cDNAs or full length cDNAs may also be obtained. 

In some embodiments, the extended cDNAs obtained using the 5' ESTs include the 
signal sequence. In other embodiments, the extended cDNAs obtained using the S' ESTs may 
indude the fiiU coding sequence for the mature proton {Le. the protdn generated when the 
signal polypeptide is cleaved ofif). In addition, the extended cDNAs obtained using the S' 
25 ESTs may include regulatory regions upstream of the translation start site or downstream of 
the stop codon which control the amount, location, or developmental stage of gene 
expressioa 

As discussed above, secreted proteins are therapeutically important. Thus, the 
proteins expressed from the extended cDNAs or fiill length cDNAs obtained using the 5' 
30 ESTs may be usefijl in treating or conu-olling a variety of human conditions. 
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The 5' ESTs (or cDNAs or genomic DNAs obtained therefrom) may be used in 
forensic procedures to identify individuals or in diagnostic procedures to identify individuals 
having genetic diseases resultii^ from abnonnal expression of the genes corresponding to the 
5' ESTs. In addition, the present invention is use&l for construaing a high resolution map of 
5 the human chromosomes. 

The present invention also rdates to secredon vectors capable of directing the 
secretion of a proteta of interest Such vectors may be used m gene therapy strategies in 
which it is desired to produce a gene product in one cell which is to be delivered to another 
location in the body. Secretion vectors may also fidlitate the purification of desired proteins 

10 The present invention also relates to expression vectors capable of directing the 

expression of an inserted gene in a desired spatial or temporal manner or at a desired level. 
Such vectors may inchide sequences upstream of the 5' ESTs, such as promoters or upstream 
regulatory sequences. 

Fmally, the present invention may also be used for gene the^^^)y to control or treat 

15 genetic diseases. Signal peptides may also be fused to heterologous proteins to direct their 
extracellular secretion. 

Bacterial clones containing Bluescript plasmids having inserts containing the S' ESTs 
of the present invendon (SEQ ID NOs: 38-270 arc presently stored at 80X in 4% (v/v) 
glycerol in the inventor's laboratories under the designations listed next to the SEQ ID NOs in 

20 . n). The inserts may be recovered from the deposited materials by growing the appropri^^^ 
clones on a suitable medium. The Bluescript DNA can then be isolated using plasmid 
isoladon procedures fimniliar to those skilled in the art such as alkaline lysis minipreps or large 
scale alkaBne lysis plasmid isolation procedures. If desired the plasmid DNA may be further 
enriched by centrifiigation on a cesium chloride gradient, size exclusion chromatography, or 

25 anion exchange chromatography. The plasmid DNA obtained using these procedures mav 
then be manipulated using standard cloning techniques 6miliar to those skilled in the an 
Alternatively, a PCR can be done with primers designed at both ends of the EST insertior. 
The PCR produa which corresponds to the 5' EST can then be manipulated using stanCa; J 
cloning techniques fruniliar to those skilled in the art. 
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One aspect of the present invention is a purified or isolated nucleic acid having the 
sequence of one of SEQ ID NOs: 38-270 or having a sequence complementary thereto. In 
one embodiment, the nucleic add is recombinant. 

Another aspect of the present invention is a purified or isolated nucleic acid 
5 comprising at least 10 consecutive bases of the sequence of one of SEQ ID NOs: 38-270 or 
one of the sequences complementary thereto. 

Yet another aspect of the present invention is a purified or isolated nucleic acid 
comprising at least 1 5 consecutive bases of one of the sequences of SEQ ID NOs: 38-270 or 
one of the sequences complementary thereto. In one embodiment, the nucleic acid is 
10 recombinant. 

A fiirther aspea of the present invention is a purified or isolated nucleic acid of at 
least 15 bases capable of hybridizing under stringent conditions to the sequence of one of 
SEQ ID NOs: 38-270 or one of the sequences complementary to the sequences of SEQ ID 
NOs: 38-270. In one embodiment, the nucleic add is recombinant. 
15 Another aspect of the present invention is a purified or isolated nuddc add encoding 

a human gene product, said human gene product having a sequence parually encoded by one 
of the sequences of SEQ ID NO: 38-270. 

Still another aspect of the present invention is a method of making a cDN A encoding 
a human secretory protein, said human secretory protdn being partially encoded by one of 
20 SEQ ID NOs 38-270, comprising the steps of contacting a collection of mRNA molecules 
from human ceUs with a primer comprising at least 15 consecutive nucleotides of a sequence 
complementary to one of SEQ ID NOs: 38-270; hybridizing said primer to an mRNA in said 
collection tiiat encodes said protdn; reverse transcribing said hybridized primer to make a first 
cDNA strand from said mRNA; making a second cDNA strand complementary to said first 
25 cDNA strand; and isolating the resulting cDNA encoding said protein comprising said first 
cDNA strand and said second cDNA su^d. 

Another aspea of tiie invention is an isolated or purified cDNA encoding a human 
secretoiy protein, said human secretory protein comprising the protein encoded by one of 
SEQ ID NOs 38-270 or a fr^enl thereof of at least 10 amino adds, said cDNA being 
30 obtainable by the method described in the preceding paragraph. In one embodiment, the 
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cDNA comprises the fiiQ protein coding sequence of said protein which sequence is partially 
included in one of the sequences of SEQ ID NOs: 38-270. 

Another aspect of the present invention is a method of making a cDNA encoding a 
human secretory protein that is partially encoded by one of SEQ ID NOs 38-270, comprising 

5 the steps of obtairui^ a cDNA comprising one of the sequences of SEQ ID NOs: 38-270; 
contacting said cDNA with a detectable probe comprising at least IS consecutive nucleotides 
of said sequence of SEQ ID NO: 38-270 or a sequence complementary thereto under 
conditions which permit said probe to hybridize to said cDNA; identi^^ a cDNA which 
hybricfizes to said detectable probe; and isolating said cDNA which hybridizes to said probe. 

10 Another aspect of the present invention is an isolated or purified cDNA encoding a 

human secretory protein, said human secretory protein comprising the protein encoded by 
one of SEQ ID NOs 38-270 or a fiagment thereof of at least 10 amino acids, said cDNA 
being ofataiixable by the method described in the preceding paragraph. In one embodiment, 
the cDNA comprises the fiill protein coding sequence partially included in one of the 

1 5 sequences of SEQ ID NOs: 38-270. 

Another aspect of the present invention is a method of making a cDNA comprising 
one of the sequence of SEQ ID NOs: 38-270, comprising the steps of contacting a collection 
of mRNA molecules torn human celts with a first primer capable of hybridizing to the poly A 
tail of s^d nANA; hybridizing said first primer to said polyA tail; reverse transcribing said 

20 mRNA to make a first cDNA strand; making a second cDNA strand complementary to said 
first cDNA strand usii^ at least one primer comprising at least IS nucleotides of one of the 
sequences of SEQ ID NOs 38-270; and isolating the resulting cDNA comprising said first 
cDNA strand and said second cDNA strand. 

Another aspect of the present invention is an isolated or purified cDNA encodir^ a 

25 human secretory protein, said human secretory protein comprising the protdn encoded by 
one of SEQ ID NOs 38-270 or a fragment thereof of at least 10 amino acids, said cDNA 
being obtainable by the method described in the preceding paragraph. In one embodiment, 
the cDNA comprises the fiiU protein coding sequence partially included in one of the 
sequences of SEQ ED NOs: 38-270. 

30 In one embodiment of the method described in the two paragraphs above, the second 

cDNA strand is made by contacting said first cDNA strand with a first pair of primers, said 



wo 99/06549 



11 



PCT/1B98/01231 



first pair of primers comprising a second primer comprisir^ at least IS consecutive 
nucleotides of one of the sequences of SEQ ID NOs 3S-270 and a third primer having a 
sequence therein which is included within the sequence of said first primer; performing a first 
polymerase chain reaction with said first pair of nested primers to generate a first PCR 

5 product; contacting said first PCR product with a second pair of primers, said second- pair of 
primers comprising a fourth primer, said fourth primer comprising at least 15 consecutive 
nucleotides of said sequence of one of SEQ ID NOs: 38-270 , and a fifth primer, said fourth 
and fifth primers being capable of hybridizing to sequences within said first PCR product; and 
performing a second polymerase chain reaction, thereby generating a second PCR product. 

10 One aspect of the present invention is an isolated or purified cDNA encoding a 

human secretory protein, said human secretory protein comprising the protein encoded by 
one of SEQ ID NOs 38*270, or a fragment thereof of at least 10 amino adds, said cDNA 
bong obtainable by the method of the preceding paragraph. In one embodiment, the cDNA 
comprises the fiill protein cocfiz^ sequence partially included in one of the sequences of SEQ 

15 ID NOs: 38-270. 

Another aspect of the present invention is the method described four paragraphs 
above in which the second cDNA strand is made by contacting said first cDNA strand with a 
second primer comprising at least 15 consecutive nucleotides of the sequences of SEQ ID 
NOs: 38-270; hybridizing said second primer to said first strand cDNA; and extending said 

20 hybridized second primer to generate said second cDNA strand. 

Another aspect of the present invention is an isolated or purified cDNA encoding a 
human secrctoiy protein, said human secretory protein comprising the protein partially 
encoded by one of SEQ ID NOs 38-270 or comprising a fragment thereof of at least 10 
amino acids, said cDNA being obtainable by the m^od described in the preceding 

25 paragraph. In. one embodiment, the cDNA comprises the fiiU protein coding sequence 
partially included in of one of the sequences of SEQ ID NOs: 38-270. 

Another aspect of the present invention is a method of making a protein comprising 
one of the sequences of SEQ ID NOs: 271-503, comprising the steps of obtaining a cDNA 
encoding the fiiU protein sequence partially included in one of the sequences of sequence of 

30 SEQ ID NOs: 38-270; inserting said cDNA in an expression vector such that said cDNA is 
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operably linked to a promoter, introducing said expression vector into a host cell whereby 
said host cell produces the protein encoded by said cDNA; and isolating said protein. 

Another aspect of the present invention is an isolated protein obtainable by the 
method described in the preceding paragraph. 
S Another aspect of the present invention is a method of obtaining a promoter DNA 

comprising the steps of obtaining DNAs located upstream of the nucleic adds of SEQ ID 
NOs: 38-270 or the sequences complementary thereto; screening said upstream DNAs to 
identify a promoter capable of directing transcription initiation; and isolating said DNA 
comprising said identified promoter. In one embodiment, the obtainii^ step comprises 

10 chromosome walking firdm said nucleic acids of SEQ ID NOs; 38-270 or sequences 
complementary thereto. In another embodiment, the screening step comprises insening said 
upstream sequences into a promoter reporter vector. In another embodiment, the screening 
step comprises identifying moti& in said upstream DNAs which are transcripdon factor 
binding sites or transcription start sites. 

15 Another aspect of the present invention is an isolated promoter obtainable by the 

method described above. 

Another aspect of the present invention is an isolated or purified protein comprising 
one of the sequences of SEQ ID NOs: 271-503. 

Another aspect of the present invention is the inclusion of at least one of the 

20 sequences of SEQ ID NOs: 38-270, or one of the sequences complementary to the sequences 
of SEQ ED NOs: 38-270, or a fragment thereof of at least 15 consecutive nucleotides in an 
msy of disaete ESTs or firagments thereof of at least IS nucleotides in lei^. In one 
embodiment, the array inchides at least two of the sequences of SEQ ID NOs: 38-270, the 
sequences complementary to the sequences of SEQ ID NOs: 38-270, or Segments thereof of 

23 at least IS consecutive nucleotides. In another embodiment, the array includes at least five of 
the sequences of SEQ ID NOs: 38-270, the sequences complementary to the sequences of 
SEQ ID NOs: 38-270, or fiagments thereof of at least 15 consecutive nucleotides. 

Another aspect of the present invention is a promoter having a sequence selected 
from the group consisting of SEQ ID NOs 3 1 . 34, and 37. 
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Brief Description of the Dtuwings 
Figure 1 is a summary of a procedure for obtaining cDNAs which have been 
selected to include tlic 5' ends of the niRNAs from which they derived. 

Figure 2 shows the distribution of Von Heijne scores for 5' ESTs in each of the 
5 categories described herein and the probability that these 5' ESTs encode a signal peptide. 

Figure 3 summarizes a general method used to clone and sequence extended cDNAs 
containing sequences adjacent to 5' ESTs. 

Figure 4 (description of promoters structure isolated from Signaffag 5* ESTs) 
provides a schematic description of promoters isolated and the way they are assembled with 
1 0 the corresponding 5' tags. 

Detailed Descrintion of the Preferred Embodiment 
Table IV is an analysis of the 43 amino adds located at the N terminus of all human 
SwissProt proteins to determine the frequency of false positives and false negatives using the 
techniques for signal peptide identification described herein. 
I S Table V shows the distribution of S' ESTs in each category described herein and the 

number of 5' ESTs in each category having a given minimum Von Heijne's score. 

Table VI shows the distribution of 5' ESTs in each category described herein with 
respect to the tissue from which the 5* ESTs of the corresponding mRNA were obtained. 

Table VII describes the transcription factor binding sites present in each of these 
20 promoters. 

L Gcncml Methods for Obtaining ESTs derived from mRNAs with intnct 
ends 

In order to obtain the S* ESTs of the present invention, mRNAs with intact S' 
2S ends must be obtained. Currently, there are two approaches for obtaining such mRNAs 
with intact S* ends as described below: either chemical (1) or enzymatic (2). 

1 Chemical Methods for Obtaining mRNAs having Intact Ends 

One of these approaches is a chemical modification method involving derivatization 
30 of the 5' ends of the mRNAs and selection of the derivatized mRNAs. The 5' ends of 
eukaiyotic mRNAs possess a structure referred to as a **cap" which comprises a guanosine 
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methylated at the 7 position. The cap is joined to the first transcribed base of the mRNA by a 
S\ 3 '-triphosphate bond. In some instances, the 5' guanosine is methylated in both the 2 and 
7 positions. Rarely, the 5' guanosine is trimethylated at the 2, 7 and 7 positions. In the 
chemical method for obtaining mRNAs having intact S' ends, the 5' cap is specifically 
S derivatized and coupled to a reactive group on an immobilizing substrate This specific 
derivatization is based on the fiict that only the ribose linked to the methylated guanosine at 
the S' end of the mRNA and the ribose linked to the base at the 3' terminus of the mRNA, 
possess 2\ 3'-cis diols. 

Optionally, the 2', 3'-cis diol of the 3' terminal ribose may be chemically 

10 modified, substituted, converted, or eliminated, leaving only the ribose linked to the 
methylated guanosine at the S' end of the mRNA with a 2\ 3'-cis dioL A variety of 
techniques are available for eliminating the 2\ 3'-cis diol on the 3' terminal ribose. For 
example, controlled alkaline hydrolysis may be used to generate mRNA firagments in 
vAich the 3' terminal ribose is a 3'-phosphate, 2'-phosphate or {2\ 3')-cyclophosphate. 

1 S Thereafter, the fragment which includes the original 3' ribose may be eliminated Scorn the 
mbcture through chromatography on an oligodT colunm. Alternatively, a base which 
lacks the 2\ 3'-cis diol may be added to the 3' end of the mRNA using an RNA ligase 
such as T4 RNA ligase. Example 1 below describes a method for ligation of a 
nucleoside diphosphate to the 3' end of messenger RNA. 

20 - 

EXAMPLE 1 

Ligation of the Nucleoside Diphosphate pCp to the 3' End of mRNA. 

One ^g of RNA was incubated in a final reacdon medium of 10 ^1 in the 
presence of S U of T4 phage RNA ligase in the buffer provided by the manufacturer (Gibco - 
25 BRL), 40 U of the RNase inhibitor RNasin (Promega) and, 2 jil of ^pCp (Amersham «PB 
10208). The incubation was perfomraed at 37*C for 2 hours or overnight at 7-8*^0. 



Following modification or elimination of the 2\ 3'-cis diol at the 3' ribose, the 2'. 
3'-cis diol present at the 5' end of the mRNA may be oxidized using reagents such as 
30 NaBH^, NaBH,CN, or sodium periodate, thereby converting the 2', 3'-cis diol to a 
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dialdehyde. Example 2 describes the oxidation of the 2\ 3'-cis dtol at the 5' end of the 
mRNA with sodium periodate. 

EXAMPLE 2 

5 Oxidation of 2'. 3'-cis diol at the 5' End of the mRNA with Sodium Periodate 

O.I OD unit of either a capped oligoribonucleotide of 47 nucleotides (including the 
cap) or an uncapped oligoribonucleotide of 46 nucleotides were treated as follows. The 
oligoribonucleotides were produced by in vitro transcription using the transcription kit 
"AmpliScribe T7" (Epicentre Technologies). As indicated below, the DNA template for the 

1 0 RNA transcript contained a single cytosine. To synthesize the uncapped RNA, all four NTPs 
were included in the in vitro transcription reaction. To obtain the capped RNA, GTP was 
replaced by an analogue of the cap« m7G(S')ppp(S'}G. This compound, recognized by the 
po^erase, was incorporated into the S' end of the nascent transcript during the initiation of 
transcription but was not incorporated during the extension step. Consequently, the resulting 

1 S RNA contained a cap at. its S' end. The sequences of the oligoribonucleotides produced by 
the in vitro transcription reaction were: 
+Cap: 

S'mTGpppGCAUCCUACUCCCAUCCAAUUCCACCCUAACUCCUCCCAUCUCCAC- 
3' (SEQIDNO:!) 
20 . -Cap: 

5'-pppGCAUCCUACUCCCAUCCAAUUCCACCCUAACUCCUCCCAUCUCCAC-3' 
(SEQIDN0:2) 

The oligoribonucleotides were dissoh/ed m 9 (il of acetate buffer (0.1 M so(Uum 
acetate, pH 5.2) and 3 (xl of freshly prepared 0. 1 M sodium periodate solution. The mbcmre 
25 was incubated for 1 hour in the dark at 4®C or room temperature. Thereafter, the reaction 
was stopped by adding 4 ^ of 10% ethylene glycol. The product was ethanol predfritated, 
resuspended in at least 10 ^1 of water or appropriate buffer and dialyzed against water. 

The resulting aldehyde groups may then be coupled to molecules having a reactive 
30 amine group, such as hydrazine, carbazide, ihiocarbazide or semicari)azide groups, in order to 
fadlitate enrichment of the 5' ends of the mRNAs. Molecules having reaaive amine groups 
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which are suitable for use in selecting mRNAs having intact 5' ends include avidin, proteins, 
antibodies, vitamins, ligands capable of specifically binding to receptor molecules, or 
oligonucleotides. Example 3 bdow describes the coupling of the resulting dialdetiyde to 
biotin. 

5 

EXAMPLES 

Coupling of the Dialdehvde at the 5' End of Transcript s with Biotin 
The oxidation product obtained in Example 2 was dissolved in SO of sodium 
acetate at a pH between 5 and 5.2 and 50 \il of fi-eshly prepared 0.02 M solution of biotin 
10 hydrazide in a methoxytthanol/water mixture (1:1) of formula: 



H 



NH 2— NH -C— (CH2)n— NH "C— (CHa)^ ' ' ^ NH 

In the compound used in these experiments, n=5. However, it will be appreciated that 
other commercially available hydrazides may also be used, such as molecules of the above 
formula in which n varies firom 0 to 5. The mwture was llien incubated for 2 hours at 37*C, 
1 5 precipitated with ethanol and diaiyzed against distilled water. Example 4 demonstrates the 
spedfidty of the biotinylation reactioa 



EXAiMPLE4 

Soecifidtv of Biorin^ation of Capped Transcripts 
20 The ^)ecificity of the biotiiqdation for capped mRNAs was evaluated by gel 

electrophoresis of the following samples: 

Sample 1. The 46 nucleotide uncapped in vitro transcript prepared as in Example 2 
and labeled with ^^pCp as described in Example 1 . 

Sample 2. The 46 nucleotide uncapped in vitro transcript prepared as in Example 2, 
25 labeled with '^pCp as described in Example 1, treated with the oxidation reaaion of Example 
2, and subjeaed to the biotinylation conditions of Example 3. 
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Sample 3. The 47 nucleotide capped /// vitro transcript prepared as in Example 2 and 
labeled with ^pCp as described in Example I. 

Sample 4. The 47 nucleotide capped in vitro transcript prepared as in Example 2, 
labeled with ^^pCp as described in Example 1, treated with the oxidation reaction of Example 
S 2, and subgected to the biotinylation conditions of Example 3. 

Samples 1 and 2 had identical migration rates, demonstrating that the uncapped 
RNAs were not oxidized and biotinylated. Sample 3 migrated more slowly than Samples 1 
and 2» while Sample 4 exhibited the slowest migration. The difference in migration of the 
RNAs in Samples 3 and 4 demonstrates that the capped RNAs were specifically biotinylated 

10 

In some cases, mRNAs having intaa 5' ends may be enriched by binding the molecule 
containing a reactive amine group to a suitable solid phase substrate such as the inside of the 
vessel containing the mRNAs, magnetic beads, chromatography matrices, or nylon or 
nitrocellulose menibranes. For example, where the molecule having a reacdve amine group is 

IS biotin, the solid phase substrate may be coupled to avidin or streptavidin. Alternatively, 
where the molecule having the reactive amine group is an antibody or receptor ligand, the 
solid phase substrate may be coupled to the cognate ant^en or receptor. Finally, where the 
molecule having a reactive amine group comprises an oligonucleotide, the solid phase 
substrate may comprise a complementary oligonucleotide. 

20 The mRNAs having intact 5' ends may be released from the solid phase following the 

enrichment procedure. For example, where the dialdehydc is coupled to biotin hydrazidc and 
the solid phase comprises streptavidin, the mRNAs may be released from the solid phase by 
simply heating to 95 d^ees Celsius in 2% SDS. In some methods, the molecule having a 
reactive amine group may also be cleaved from the mRNAs having intact 5' ends following 

25 enrichment. Example 5 describes the capture of biotinylated mRNAs with streptavidin coated 
beads and the release of the biotinylated mRNAs from the beads following enrichment. 

EXAMPLES 

Capture and Release of Biotinvlated mRNAs Using Streptavidin Coated Beads 
30 The streptavidin coated magnetic beads were prepared according to the 

manufacturer's instructions (CPG Inc., USA) The biotinylated mRNAs were added :o i 
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hybridization buffer (1.5 M NaCl, pH 5 - 6). After incubating for 30 minutes, the unbound 
and nonbiotinylated material was removed. The beads were then washed several times in 
water with 1% SDS. The beads thus obtained were incubated for 15 minutes at 95*'C in 
water containing 2% SDS. 
S Example 6 demonstrates the e£5ciency with which biotinylated mRNAs were 

recovered from the streptavidin coated beads. 

EXAMPLE 6 
EflSciencv of Recovenr of Biotinvlated mRNAs 
10 The efficiency of the recovery procedure was evaluated as follows. Capped RNAs 

were labeled with ^^pCp, oxidized, biotinylated and bound to streptavidin coated beads as 
described above. Subsequentiy, the bound RNAs were incubated for 5, 15 or 30. minutes at 
9S''C in the presence of 2% SDS. 

The products of the reaction were analyzed by electrophoresis on 12% 
IS polyacrylamide gels under denaturing conditions (7 M urea). The gels were subjected to 
autoradiograplqr. During this manipulation, the hydrazone bonds were not reduced. 

Increaang amounts of nucleic acids were recovered as incubation times in 2% SDS 
inaeased, demonstrating tiiat biotinylated mRNAs were efficiendy recovered. 

20 In an akemative n»thod for obtaining mRNAs having intact S' ends, an 

oligonucleotide which has been derivatized to contain a reactive amine group is specifically 
coupled to mRNAs having an intact cap. Preferably, the 3' end of the mRNA is blocked prior 
to the step in which the aldehyde groups are joined to the derivatized oligonucleotide, as 
described above, so as to prevent the derivatized oligonucleotide from being joined to the 3* 

25 end of the mRNA. For example, pCp may be attached to the 3' end of die mRNA using 14 
RNA ligase as described in example I. However, as discussed above, blocking the 3' end of 
the mRNA is an optional step. Derivatized oligonucleotides may be prepared as described m 
Example 7. 
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EXAMPLE 7 

p^rivatization of Oligonucleotides 
An oligonucleotide phosphorylated at its 3' end was converted to a 3' hydrazide in 3' 
by treatment with an aqueous solution of hydrazine or of dihydrazide of the formula 
5 H2N(R1)NH2 at about 1 to 3 M, and at pH 4.5 at a temperature of 8*0 overnight. This 
incubation was performed in the presence of a caibodiimide type agent soluble in water such 
as l-ethyl-3-(3-dimethylaminopropyl)caitodiiniide at a final concentration of 0.3 M. 

The derivatized oligonucleotide was then separated fi^om the other agents and 
products using a standard technique for isolating oligonucleotides. 
1 0 As discussed above, the mRNAs to be enriched may be treated to eliminate the 3' OH 

groups which may be present thereon. This may be accomplished by enzymatic ligation of 
sequences lacking a 3' OH, such as pCp, as described in Example 1 . Alternatively, the 3' OH 
groups may be eliminated by alkaline hydrolysb as described in Example 8 below. 

15 EXAMPLES 

FJimination of V OH Groups of mRNA Usine Alkaline Hydrolysis 
In a total volume of 100 of 0. 1 N sodium hydroxide. 1.5 ^g mRNA is incubated 
for 40 to 60 minutes at 4**C. The solution is neutralized with acetic arid and predpitated with 
ethanoL 

20 Following the optional elimination of the 3' OH groups, the diol groups at the 5' ends 

of the mRNAs are oxidized as described below in Example 9. 



EXAMPLE 9 

Oxidation of D tols of mRNA 

25 Up to 1 OD unit of RNA was dissolved in 9 mI of buflFer (0.1 M sodium acetate. pH 

6-7) or water and 3 ^1 of freshly prepared 0.1 M sodium periodate solution. The reaction 
was incubated for I h in the dark at 4X or room temperature. Following the incubation, the 
reaction was stopped by adding 4 of 10% ethylene glycol. Thereafter the mixture was 
incubated at room temperature for 15 minutes. After ethanoi precipitation, the product was 

30 resuspended in at least 10 ^1 of water or appropriate buflFer and dialyzed against water. 
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Following oxidation of the diol groups at the 5' ends of the mRNAs, the derivatized 
oligonucleotide was joined to the resulting aldehydes as described in Example 10. 

EXAMPLE 10 

5 Ligature of Ald ehydes of mRNA to Derivatized Oligonucleotides 

The oxidized mRNA was dissolved in an acidic medium such as 50 ^1 of sodium 
acetate pH 4-6. Fifty |il of a solution of the derivatized oligonucleotide were added in order 
to obtain an mRNA:derivatized oligonucleotide ratio of 1:20. The nuxture was reduced with 
a borohydride and incubated for 2 h at 37^C or overnight (14 h) at 10*C. The nuxture was 
10 then ethanol precipitated, resuspended in 10 nl or more of water or appropriate buflFer and 
cfialyzed against distilled water. If desired, the resulting product may be analyzed uang 
acrylamide gel electrophoresis, HPLC analysis, or other conventional techniques. 

Following the attachment of the derivatized oligonucleotide to the mRNAs. a 
1 5 reverse transcription reaction may be performed as described in Example 1 1 below. 

EXAMPLE 11 

Reverse Transcription of mRNAs Lisamred to Derivatized O ligonucleotides 
An oligodeoxyribonudeotide was derivatized as follows. Three OD units of an 
20 . oligodeoxyribonudeotide of sequence 5' ATCAAGAATTCCSCACGAGACCATTAS' (SEQ 
ID N0:3) having 5'-0H and 3'-P ends were dissolved in 70 jil of a 1.5 M 
hydroxybcnzotriazole solution, pH 5,3, prepared in dimethylforniamide/water (75:25) 
containing 2 ^g of l-ethyl-3<3-<taethylaminopropyl)carbodiiniide. The nuxture was 
incubated for 2 h 30 min at 22**C and then precipitated twice in LiClO^/aceione. The pellet 
25 was resuspended in 200 of 0.25 M hydrazine and incubated at 8*C from 3 to 14 h. 
Following the hydrazine reaction, the mixture was precipitated twice in LiClOVacetone. 

The messenger RNAs to be reverse transcribed were extraaed from blocks of 
placenta having sides of 2 cm which had been stored at -80^C. The total RNA was extracted 
using conventional acidic phenol techniques. Oligo-dT chromatography was used to purify 
30 the mRNAs. The integrity of the mRNAs was checked by Northem-blotting. 
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The diol groups on 7 ^ of the placental mRNAs were oxidized as described above in 
Example 9. The derivatized oligonucleotide was joined to the mRNAs as described in 
Example 10 above except that the precipitation step was replaced by an exclusion 
chromatography step to remove derivatized oligodeoxyribonudeotides v^ch were not joined 
3 to mRNAs. Exclusion dvomatograpl^ was performed as follows: 

Ten ml of Ultrogd AcA34 (BtoSepra#2301Sl) gel, a mix of agarose and acrylamide, 
were equilibrated in SO ml of a solution of 10 mM Tris pH 8.0, 300 mM NaCl 1 mM EDTA, 
and 0.05% SDS. The mbcture was allowed to sediment. The supernatant was eliminated and 
the gel was resuspended in SO ml of buffer. This procedure was repeated 2 or 3 times. 
10 A glass bead (diameter 3 mm) was introduced into a 2 ml disposable pipene (length 

25 cm). The pipette was filled with the gel suspension until the height of the gel stabilized at 
1 cm fi-om the top of the pipette. The column was then equilibrated with 20 ml of 
equilibration buffer ( 10 mM Tris HCl pH 7.4, 20 mM NaCl). 

Ten Ml of the mRNA which had reacted with the derivatized oligonucleotide were 
15 mixed in 39 m1 of 10 mM uiea and 2 \sl of blue-g}ycerol buffer, which had been prepared by 
dissohdng 5 mg of bromo{rfienoI blue in 60% glycerol (v/vX and passing the mixture through 
a 0.45 (mi (fiameter fiber. 

The column was then loaded with the mRNAs coupled to the oligonucleotide. As 
soon as the sample had penetrated, equilibration buffer was added. Ibmdred m1 fiacdons were 
20 . then collected. Derivatized oligonucleotide which had not been attached to mRNA appeared 
in fiaction 16 and later fiactions. Thus, firactions 3 to 15 were combined and precipitated 
withethanol. 

To detemiine whether the derivatized oligonucleotide was actually linked to mRNA, 
one tenth of the combined fractions were spotted twice on a nylon membrane and hybridized 

25 to a racfioacdve probe using conventional techniques. The ^^P labeled probe used in these 
hybridizations was an oligodeoxyribonucleodde of sequence 
5TAATGGTCTCGTGCGAATrCTTGAT3' (SEQ ID N0:4) amicomplementary to the 
derivatized oligonucleotide. A signal observed after autoradiography, indicated that the 
derivatized oligonucleotide had been taily joined to the mRNA. 

30 The remaining nine tenth of the mRNAs which had reacted with tiie derivatized 

oligonucleotide was reverse transcribed as follows. A reverse transcription reaction was 
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carried out with reverse trarscriptase following the manufacturer's instructions and 50 pmol 
of nonamers with random sequence as primers. 

To ensure that reverse transcription had been carried out through the cap structure, 
two types of experiments were performed. 

5 In the first approach, after elimination of RNA of the cDNA:RNA heteroduplexes 

obtained fiom the reverse transcription reaction by an alkaline hydrolysis, a portion of the 
resulting single stranded cDNAs was spotted on a positively charged membrane and 
hybridized, using conventional methods, to a ^^P labded probe having a sequence identical to 
that of the derivatized oligonucleotide. Control spots containing, 1 pmol, 100 finol, 50 finol, 

10 10 finol and I finol of a control oligodeoxyribonucleotide of sequence identical to that of the 
derivatized oligonucleotide were included. The signal observed in the spots containing the 
cDNA indicated that approximately 15 finol of the derivatized oligonucleotide had been 
reverse transcribed. These results demonstrate that the reverse transcription can be 
performed tiirough the cap and, in particular, thai reverse transcriptase crosses the 5'-P-P-P- 

15 5' bond of the cap of eukaiyotic messenger RN As. 

In the second type of experiment, the single stranded cDN As obtained from the above 
first strand syrthesis were used as template for PCR reactions. Two types of reaaions were 
carried out First, specific ampBfication of the mRNAs for alpha globin. dehydrogenase, ppl 5 
and dongation fector E4 were carried out usir^ the foflowir^ pairs of 

20 ofigodeoj^bonudeotide primers. 

alpha-globin 

GLO-S: 5'CCG ACA AGA CCA ACG TCA AGG CCG C3' (SEQ ID N0:5) 
GLO-As: 5TCA CCA GCA GGC ACT GGC TTA GGA G 3' (SEQ ID N0:6) 

25 

dehydrogenase 

3 DH-S: 5' AGT GAT TCC TGC TAG ITT GGA TGG C3' (SEQ ID N0:7) 
3 DH-As: 5'GCT TGG TCT TGT TCT GGA GTT TAG A3' (SEQ ID N0:8) 

30 ppl5 

PP15-S: 5'TCC AGA ATG GGA GAC AAG CCA ATT T3' (SEQ ID N0:9) 
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PPl 5-As: 5' AGO GAG GAG GAA ACA GCG TGA GTC C3' (SEQ ID NO: 10) 

Elongation &ctor E4 
EFAl-S: S'ATG GGA AAG GAA AAG ACT CAT ATC A3' (SEQ ID N0:1 1) 
5 EFl A-As: 5'AGC AGC AAC AAT CAG GAC AGC ACA G3' (SEQ ID N0:12) 



Second, non spedfic amplifications were also carried out with the antisense 
oligodeo^QTibonucieotides of the pairs described above and with a primer derived from the 
sequence of the derivatized oiigodeoxyribonudeotide 

10 (5'ATCAAGAATTCGCACGAGACCATTA3') (SEQ ID N0:13). 

One twentieth of the followii^ RT-PCR product samples were run on a 1.5% |^ 
agarose gel and stained with ethidium bromide. 

Sample 1: The products of a PGR reaction using the globin primers of SEQ ID NOs 
S and 6 in the presence of cDNA. 
1 5 Sanqple 2: The products of a PCR reaction using the globin primers of SEQ ID NOs 

3 and 6 in the absence of added cDNA. 

Sample 3: The products of a PCR reaction using the dehydrogenase primers of SEQ 
ED NOs 7 and 8 in the presence of cDNA 

Sample 4: The products of a PCR reaction using the dehydrogenase primers of SEQ 
20 ID NOs 7 and 8 in the absence of added cDNA. 

Sample 5: The products of a PCR reaction using the ppl 5 primers of SEQ ID NOs 0 
and 10 in the presence of cDNA. 

Sample 6: The products of a PCR reacdon using the pplS primers of SEQ ID NOs 9 
and 10 in the absence of added cDNA. 
25 Sample 7: The products of a PCR reaction using the EIF4 primers of SEQ ID NOs 

1 1 and 12 in the presence of added cDNA. 

Sample 8: The products of a POL reaction using the EIF4 primers of SEQ ID NOs 
11 and 1 2 in the absence of added cDN A. 

A band of the aze expected for the PCR produa was ob.served only in samples 1, ?v 
30 5 and 7, thus indicating the presence of the corresponding sequence in the cDNA population 



wo 99/06549 PCT/1B98/0I23I 

24 

PCR reactions were also carried out with the andsense oligonucleotides of the globin 
and dehydrogenase primers (SEQ ID NOs 6 and 8) and an oligonucleotide whose sequence 
corresponds to that of the derivatized oligonucleotide. The presence of PCR products of the 
expected size in the samples equivalent to above samples I and 3 indicated that the 
S derivatized oligonucleotide had been linked to mRNA. 

The above examples summarize the chemical procedure for enriching mRNAs for 
those having intact 5' ends as illustrated in Figure 1. Further detail regarding the chemical 
approaches for obtaining such mRNAs are disclosed in International Application No. 

10 W096/34981, publidied November 7, 1996, which is incorporated herein by reference. 
Strategies based on the above chemical modifications to the 5' cap structure may be utilized 
to generate cDNAs seleaed to include the S' ends of the mRNAs from which they derived. 
In one version of sudi procedures, the S' ends of the mRNAs are modified as described 
above. Thereafter, a reverse transcription reaction is conducted to extend a primer 

IS complementary to the 5' end of the mRNA. Sii^e stranded RNAs are eliminated to obtain a 
population of cDNA/mRNA heteroduplexes in which the mRNA includes an intaa S' end. 
The resulting heteroduplexes may be captured on a solid phase coated with a molecule 
capable of interacting with the molecule used to derivatize the 5' end of the mRNA. 
Thereafter, the strands of the heteroduplexes are separated to recover single su^ded first 

20 .cDNA strands which indude the 5' end of the mRNA. Second strand cDNA synthesis may 
then proceed using conventional techniques. For example, the procedures disclosed in WO 
96^4981 or in CamincL ei al.^ Genomics 37:327-336, 1996, the disclosures of which are 
incorporated herein by reference, may be employed to select cDNAs which include the 
sequence derived from the 5' end of the coding sequence of the mRNA. 

2S Following ligation of the oligonucleotide tag to the S' cap of the mRNA, a 

reverse transcription reaction is conducted to extend a primer complementary to the 
mRNA to the 5' end of the mRNA. Following elimination of the RNA component of the 
resulting heteroduplex using standard techniques, second strand cDNA synthesis is 
conducted with a primer complementary to the oligonucleotide tag. 
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2, Enzymatic* Methods for Obtaining mRNAs having Intact 5' Ends 

Other techniques for selecting cDNAs extending to the 5' aid of the niKSA from 
which they are derived are fully enzymatic. Some versions of these techniques are disclosed 
in Dumas Nfilne Edwards J.B. (Doctoral Thesis of Paris VI University, Le clonage des ADNc 
5 complets: difficukes et perspectives nouvetics. Apports pour Fetude de la regulation de 
fexpression de la tryptophane hydroxylase de rat, 20 Dec. 1993), EPO 625572 and Kaio et 
ai. Gene 150:243-250, 1994, the disclosures of which are incorporated herein by reference. 

Briefly, in such approaches, isolated mRNA is treated with alkaline phosphatase 
to remove the phosphate groups present on the 5' ends of uncapped incomplete mRNAs. 

10 Following this procedure, the cap present on full length mRNAs is enzymatically 
removed with a decapping enzyme such as T4 polynucleotide kinase or tobacco acid 
pyrophosphatase. An oligonucleotide, which may be either a DNA oligonucleotide or a 
DNA-RNA hybrid oligonucleotide having RNA at its 3' end. is then ligated to the 
phosphate present at the S' end of the decapped mRNA using T4 RNA ligase. The 

15 oligonucleotide nnay include a restriction site to facilitate cloning of the cDNAs following 
their synthesis. Example 12 below describes one enzymatic method based on the 
doctoral thesis of Dumas. 

EXAMPLE 12 

20 Enzvmatic Approach for Obtaining 5' ESTs 

Twenty micrograms of PolyA+ RNA were dephosphorylated using Calf Intestinal 
Phosphatase (Biolabs). Afler a phenol chlorofomi exu^on, the cap structure of mRNA 
was hydrolysed using the Tobacco Acid Pyrophosphatase (purified as described by Shinshi et 
aL, Biochemistry 15: 2185-2190, 1976) and a hemi 5T)NA/RNA-3' oligonucleotide having 

25 an unphosphorylated 5' end, a stretch of adenosine ribophosphate at the 3' end, and an EcoRI 
site near the 5' end was iigated to the 5'P ends of mRNA using the T4 RNA ligase (Biolabs). 
Oligonucleotides suitable for use in this procedure are preferably 30 to 50 bases in length. 
Oligonucleotides havii^ an unphosphorylated S' end may be synthesized by adding a 
fluorochrome at the 5' end. The inchision of a stretch of adenosine ribophosphates at the 3' 

30 end of the oligonucleotide increases ligation e£5ciency. It will be appreciated that the 
oligonucleotide may comain cloning sites other than EcoRI. 
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Following ligation of the oligonucleotide to the phosphate present at the 5' end of 
the decapped niRNA, first and second strand cDNA s>'nthesis is carried out using 
conventional methods or those specified in EPO 625,572 and Kato et ai sttpra, and 
Dumas ^4ilne Edwards, supra, the disclosures of which are incorporated herein by 
5 reference. The resulting cDNA may then be ligatcd into vectors such as those disclosed 
in Kato et aL^ supra or other nucleic acid vectors known to those skilled in the art using 
techniques such as those described in Sambrook et aL, Molecular Cloning: A Laboratory 
Manual 2d Ed., Cold Spring Harbor Laboratory Press, 1989, the disclosure of which is 
incorporated herein by reference. 

10 

IL Obtention and Characterization of the 5' ESTs of the Present Invention 

The 5' ESTs of the present invention were obtained using the aforementioned 
chemical and enzymatic approaches for enriching mRNAs for those having intact 5' ends 
as decribed below. 

15 

1 Obtention of ESTS Using mRNAs with Intact 5^ Ends 

First, mRNAs were prepared as described in Example 13 below. 

EXAMPLE 13 

20 Preparation of mRNA With Intaa 5' Ends 

Total human BNAs or polyA' RNAs derived from 29 different tissues were 

req)ecth^ely purchased fii»n LABIMO and CLONTECH and used to generate 44 cDNA 

Ebraries as follows. The purchased RNA had been isolated from cells or tissues using acid 

guanidium thiocyanate-phenol-chloroform extraction (Chomczyniski and Sacchi, Analytical 
25 Biochemistry 162:156-159, 1987). PolyA* RNA was isolated from total RNA (LABIMO) 

by two passes of oligo dT chromatography, as described by Aviv and Leder, Proc Natl. 

AcadL Set 'USA 69: 1408-1412, 1972 in order to eliminate ribosomal RNA. 

The quality and the integrity of the polyA-^ RNAs were checked. Northern blots 

hybridized with a globin probe were used to confirm that the mRNAs were not degraded. 
30 Contamination of the polyA"" mRNAs by ribosomal sequences was checked using Northern 

blots and a probe derived from the sequence of the 28S rRNA. Preparations of mRNAs with 
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less than 5% of rRNAs were used in library construction. To avoid constructing libraries 
with RNAs contaminated by exogenous sequences (prokaryotic or fiingal). the presence of 
bacterial i6S ribosomal sequences or of two hi^y expressed fungal mRNAs was examined 
using PCR. 

5 Following preparation of the mRNAs, the above described chemical and/or the 

enzymatic procedures for enriching mRNAs for thoses having intact 3' ends were employed 
to obtain 5' ESTs from various tissues. In both approaches, an oligonucleotide tag was 
attached to the S' ends of the mRNAs. The oligonucleotide tag had an EcoRI site therein to 
&cilitate later cloning procedures. To facilitate the processing of single stranded and double 

10 stranded cDNA obtained in the construction of the librairies, the same nucleotidic sequence 
was used to design the ligated oligonucleotide in both chemical and enzymatic approaches. 
Nevertheless, in the chemical procedure, the tag used was an oligodeoxyribonucleotide which 
was linked to the cap of the mRNA whereas in tlie enzymatic ligation, the tag was a chimeric 
hemi 5'DNA/RNA3' oligonucleotide which was ligated to the end of decapped mRNA as 

1 S described in example 12. 

Following attachment of the oligonucleotide tag to the mRNA by dther the chemical 
or enzymatic methods, the integrity of the mRNA was examined by performing a Northern 
blot with 200 to SOO ng of mRNA using a probe complementary to the oligonucleotide tag 
before p^orming the first strand synthesis as described in example 14. 

20 

EXAMPLE 14 

cDNA Synthesis Using mRNA Templates Having Intact 5^ Ends 
For the mRNAs joined to oligonucleotide tags using both the chemical and enzymatic 
methods, first strand cDNA synthesis was performed using the Superscript IT (Qbco BRL) or 

25 the Rnase H Minus M-MLV (Promega) reverse transcriptase with random nonamers a$ 
primers. In order to protect internal EcoRI sites in the cDNA from digestion at later steps in 
the procedure, methylated dCTP was used for first strand synthesis. After removal of RNA 
by an alkaline hydrolysis, the first strand of cDN A was precipitated using isopropanol in order 
to eliminate residual primers. 

30 For both the chemical and the enzymatic methods, the second strand of the cDN A 

was synthesized with a Klenow fi-agment using a primer correspondii^ to the 5' end of xrx 
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ligated oligonucleotide described in Example 12. Preferably, the primer is 20-25 bases in 
length. Methylated dCTP was also used for second strand synthesis in order to protect 
internal EcoRI sites in the cDNA from digestion during the cloning process. 

Following cDNA synthesis, the cDNAs were cloned into pBlueScript as described in 
S Example IS below. 

EXAMPLE 15 

Cloning of cDNAsderived fr om mRNA with intact ends into BlueScript 
FoHowii^ second strand synthesis, the ends of the cDNA were blunted with T4 DNA 

10 polymerase (Biolabs) and the cDNA was cfigested with EcoRI. Since methylated dCTP was 
used during cDNA synthesis, the EcoRI site present in the tag was the only hemi-methylated 
site, hence the only site suscq)tible to EcoRI cUgesdon. The cDNA was then size fractionated 
using exchision diromatography (AcA, Biosepra) and fractions corresponding to cDNAs of 
more than ISO bp were pooled and ethanol precipitated. The cDNA was directionally doned 

15 into the Smal and EcoRI ends of the phagemid pBlueScript vertor (Stratagcne). The ligation 
mixture was electroporated into baaeria and propagated under appropriate antibiotic 
selecdon. 

Clones containing the oligonucleotide tag attached were then selected as described in 
Example 16 below. 

20 

EXAMPLE 16 

Sdecrion of Clones Having the Oligonucleotide Tag Attached Thereto 
The plasmid DNAs containing 5' EST libraries made as described above were 
purified (Qiagen). A positive selection of the tagged clones was performed as follows 

25 Briefly, in this selection procedure, the plasnud DNA was converted to single stranded DN A 
using gene II endonuclease of the phage Fl in combination with an exonudease (Chang et 
Gene 127:95-8, 1993) such as exonudease ffl or T7 gene 6 exonudease. The rcsuh:nj 
single stranded DNA was then purified using paramagnetic beads as described by Fry et a: 
Biotechftiques, 13: 124-131, 1992. In this procedure, the single stranded DNA \\ai 

30 hybridized with a biotinylated oligonucleotide having a sequence corresponding to the 3' er t 
of the oligonucleotide described in Example 13. Preferably, the primer has a length of 20 * 



wo 99/06549 



PCT/IB98/0123t 



29 

bases. Clones including a sequence complementary to the biotinylated oligomicleotide were 
captured by incubation with sireptavidin coated magnetic beads followed by magnetic 
selection. After capture of the positive clones, the plasmid DNA was released from the 
magnetic beads and converted into double stranded DNA using a DNA polymerase such as 
5 the ThermoSequenase obtained from Amersham Phannada Biotech. Akemativety, 
protocoles such as the one described in the Gene Trapper kit available from Qbco BRL may 
be used. The double stranded DNA was then electroporated into bacteria. The percentage of 
positive clones having the 5' tag oligonucleotide was estimated to typically rank between 90 
and 9S% using dot bloc analysts. 
10 Following electroporatk>n, the libraries were ordered in 384-microtiter plates (MTP). 

A copy of the MTP was stored for future needs. Then the libraries were transferred into 96 
MTP and sequenced as described below. 

EXAMPLE 17 

15 Sequencing of Inserts in Selected Clon^ 

Plasmid inserts were first amplified by PCR on PE 9600 thennocyclers (Peridn-Elmer, 
Applied Biosystems Diviaon, Foster City, CA), using standard SETA-A and SETA-B 
primers (Genset SA), AmpliTaqGoU OPcridn-Elmer). dNTPs (Boehrii^erX buffer and cycling 
conditions as recommended by the Peridn-Elmer Corporation. 

20 PCR produas were then sequenced using automatic ABI Prism 377 sequencers 

(Perfcin Elmer). Sequencing reactions were performed using PE 9600 thermocyclers with 
standard dye-primer chemistry and ThermoSequenase (Amersham Pharmacia Biotech). The 
primers used were cither T7 or 21M13 (available from Genset SA) as appropriate. The 
primers were labeled with the JOE, FAM, ROX and TAMRA dyes. The dNTPs and ddNTPs 

25 used in the sequencing reactions were purchased from Boehringer. Sequencing buffer, 
reagent concemrauons and cycling conditions were as reconunended by Amersham. 

Following the sequencing reaaion, the samples were precipitated with ethanoL 
resuspended in formamide loading buffer, and loaded on a standard 4% acrylamide gel. 
Electrophoresis was performed for 2.5 hours at 3000V on an ABI 377 sequencer, and 

30 the sequence data were collected and analyzed using the ABI Prism DNA Sequencing 
Analysis Software, version 2. 1 .2. 
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2. Computer analysis of the Obtained 5* ESTs: Construction of NetGene and SignatTag 
databases 

The sequence data from the 44 cDNA libraries made as described above were 
transferred to a proprietary database, where quality control and validation steps were 

5 performed. A proprietary base-caller, working using a Unix system, automatically flagged 
suspect peaks, taking into account the shape of the peaks, the inter-peak resolution, and the 
noise level. The proprietary base<aUer also pcrfonned an automatic trinuning. Any stretch 
of 25 or fewer bases having more than 4 suspea peaks was considered unreliable and was 
discarded. Sequences corresponding to cloning veaor or ligation oligonucleotides were 

D automatically removed from the EST sequaiccs. However, the resulting EST sequences may 
contain 1 to S bases belonging to the above mentioned sequences at their 3' end. If needed, 
these can easily be removed on a case to case basis. 

Following sequencing as described above, the sequences of the 5' ESTs were entered 
in NetGene™, a proprietary database called for storage and manipulation as described below. 

S It will be appreciated by those skilled in the art that the data could be stored and manipulated 
on any medium \^ch can be read and accessed by a computer. Computer readable media 
include magnetically, optically, or electronically readable media. For example, the computer 
readable n^dia may be a hard disc, a floppy disc, a magnetic tape, CD-ROM, RAM, or ROM 
as well as other types of other mecfia known to those skilled in the art. 

[) In addition, the sequence data may be stored and manipulated in a variety of data 

processor programs in a diversity of formats. For instance, the sequence data may be stored 
as text in a word processing file, such as Microsoft WORD or WORDPERFECT or as an 
ASCn file in a variety of database programs ^miliar to those of skill in the art, such as DB2, 
SYBASE, or ORACLE. 

5 The computer readable media on which the sequence information is stored may be in 

a personal computer, a network, a server or other computer systems known to those skilled in 
the art. The computer or other system preferably includes the storage media described above, 
and a processor for accessing and manipulating the sequence data. Once the sequence data 
has been stored, it may be manipulated and searched to locate those stored sequences which 

0 contain a desired nucleic acid sequence or which encode a protein having a particular 
functional domaia For example, the stored sequence information may be compared :o other 
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known sequences to identify homolc^es, motifs implicated in biological function, or 
structural motifs. 

Programs whidi may be used to search or compare the stored sequences include the 
MacPattem CEMBL), BLAST, and BLAST2 program series (NCBI), basic local alignment 
S search tool programs for nucleotide (BLASTN) and peptide (BLASTX) comparisons 
(Altschul et al, J. MoL BioL 215: 403, 1990) and FASTA (Pearson and Lipmaa Proc. NatL 
Acad Set. 85: 2444, 1988). The BLAST programs then extend the alignments on the 
basis of defined match and mismatch criteria. 

Motifs which nu^ be detected using the above programs and those described in 

10 Example 28 include sequences encoding leucine zippers, helix-tum-helix motifi, glycosylation 
sites, ubiquitination sites, alpha helices, and beta sheets, signal sequences encoding signal 
peptides which direct the secretion of the encoded proteins, sequences implicated in 
transcription r^ulation such as homeoboxes, acidic stretches, enzymatic active siles, 
substrate binding sites, and enzymatic cleavage sites. 

15 Before searching the cDNAs in the NetGene™ database for sequence motifs of 

interest. cDNAs derived from mRNAs which were not of interest were identified and 
eliminated from fiirther consideration as described in Example 1 8 below. 

EXAMPLE 18 

20 . Elimination of Undesired Seq uences from Further Consideration 

5' ESTs in the NetGene™ database which were derived fit)m undeared sequences 
such as transfer RNAs, ribosomal RNAs, mitochondrial RNAs, prokaryodc RNAs, fiingal 
RNAs» Alu sequences, LI sequences, or repeat sequences were identified using the FASTA 
and BLASTN programs with the parameters listed in Table I. 

25 To eliminate 5' ESTs encoding tRNAs from fiirther consideration, the 5' EST 

sequences were compared to the sequences of 1 190 known tRNAs obtained from EMBL 
release 38, of which 100 were human. The comparison was performed using FASTA on 
both strands of the 5' ESTs. Sequences having more than 80% homology over more than 60 
nucleotides were identified as tRNA Of the 144,341 sequences screened, 26 were ideniined 

30 as tRNAs and eliminated from further consideration. 
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To diminate S' ESTs encoding rRNAs from further consideraiion, the S' EST 
sequences were compared to the sequences of 2497 known rRNAs obtained from EMBL 
release 38, of which 73 were human. The comparison was performed using BLASTN on 
both strands of the 5' ESTs with the parameter 8=108. Sequences having more than SCW'i 
5 homology over stretches longer than 40 nucleotides were identified as rRNAs. Of the 
i 44, 341 sequences screened, 3,312 were identified as rRNAs and eliminated from further 
consideration. 

To eliminate 5' ESTs encoding mtRNAs from further consideration, the 5' EST 
sequences were compared to the sequences of the two icnown mitochondrial genomes for 

10 which the entire genomic sequences are available and all sequences transcribed from these 
mitochondrial genomes including tRNAs, rRNAs. and mRNAs for a total of 38 sequences. 
The comparison was performed using BLASTN on both strands of the S' ESTs with the 
parameter S-108. Sequences having more than 80% homology over stretches longer than 40 
nucleotides were identified as mtRNAs. Of the 144,341 sequences screened, 6,110 were 

1 S identified as mtRNAs and eliminated from fiirther consideration. 

Sequences which nnght have resulted from exogenous contaminants were eliminated 
from further consideration by comparing the 5' EST sequences to release 46 of the EMBL 
bacterial and fiingal divisions using BLASTN with the parameter S=144. All sequences 
having more than 90% homology over at least 40 nucleotides were identified as exogenous 

20 contaminants. Of the 42 cDNA libraries examined, the average percentages of prokaryotic 
and fiingal sequences contained therein were 0.2% and O.S% respeaivety. Among these 
sequences, only one could be identified as a sequence specific to fungi. The others were 
dther fiingal or prokaryotic sequences having homologies with vertebrate sequences or 
indudir^ repeat sequences which had not been masked during the electronic comparison. 

2S In addition, the S' ESTs were compared to 6093 Alu sequences and II IS LI 

sequences to mask S' ESTs containing such repeat sequences. 5' ESTs inckiding THE ard 
MER repeats, SSTR sequences or satellite, micro-satellite, or telomeric repeats were ai-^ 
eliminated from further consideration. On average^ 11.5% of the sequences in the libranes 
contained repeat sequences. Of this 11. 5%, 7% contained Alu repeats, 3.3% contained LI 

30 repeats and the remaining 1.2% were derived from the other screened types of repciit 
sequences. These percentages are consistent with those found in cDNA libraries prepare j t . 
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Other groops. Tor example, the cDNA Kbraries of Adams et ai contained between 0% and 
7.4% Alu repeatii depending on the source of the RNA which was used to prepare the cDNA 
library (Adams et ai. Nature 377: 174, 1996). 

S The sequences of those 5' ESTs remaining after the elimination of undesirable 

sequences were compared with the sequences of known human mRNAs to determine the 
accuracy of the sequencing procedures described above. 

EXAMPLE 19 

10 Measurement of Sequencing Accuracy bv Comaarison to Known f^i^ yi^nrp^ 

To fijrther determine the accuracy of the sequencing procedure described above, the 
sequaices of 5' ESTs derived from known sequences were identified and compared to the 
original known sequences. First, a FASTA analysis with overhangs shorter than 5 bp on botli 
ends was conducted on the 5' ESTs to identify those matching an entry in the public human 

15 mRNA database. The 6655 5' ESTs which matched a known human mRNA were then 
realigned with their cognate mRNA and dynamic programming was used to include 
substitutions, insertions, and deletions in the list of -errors" which would be recognized 
Errors occurring in the last 10 bases of the 5' EST sequences were ignored to avoid the 
inclusion of spurious cloning sites in the analysis of sequencing accuracy. 
20 This analysis revealed that the sequences incorporated in the NetGcne™ database had 

an accuracy of more than 99. S%. 

To determine the efficiency with which the above selection procedures select 
cDNAs whidi include the S' ends of their corresponding mRNAs, the following analysis 
25 was performed. 

EXAMPLE 20 
Determination of Efficiency of 5' EST Selection 
To detenmine the eflSciency at which the above selection procedures isolated 5' ESTs 
30 which included sequences close to the 5' end of the mRNAs from which they deriveri 

sequences of the ends of the 5' ESTs derived from the elongation fector 1 subunit a ar.j 
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ferritin heavy chain genes were compared to the known cDNA sequences of these genes 
Since the transcriptiun start sites of both genes are well characterized, they may be used to 
determine the percentage of derived S' ESTs which included the authentic transcription stan 
sites. 

5 For both genes, more than 95% of the obtained 5* ESTs actually included sequences 

close to or upstream of the 5' end of the corresponding mRNAs. 

To extend the analysis of the reliability of the procedures for isolating 5' ESTs from 
ESTs in the NetGene™ database, a similar analysis was conducted using a database 
composed of human mRNA sequences extracted from GenBank database release 97 for 

1 0 comparison. The 5' ends of more than 85% of 5' ESTs derived from mRNAs included in the 
GeneBank database were located dose to the 3' ends of the known sequence. As some of 
the mRNA sequences available in the GenBank database are deduced from genomic 
sequences, a 5' end matching with these sequences will be counted as an internal match. 
Thus, the method used hane underestimates the yield of ESTs including the authentic 5' ends 

15 oftheir corresponding mRNAs. 

The EST libraries made above included mukiple 5' ESTs derived from the same 
mRNA The sequences of such 5' ESTs were compared to one another and the longest 
5' ESTs for each mRNA were identified. Overiapping cDNAs were assembled into 
20 continuous sequences (contigs). The resulting continuous sequences were then 
compared to public databases to gauge their similarity to known sequences, as described 
in Example 21 below. 

EXAMPLE 21 

25 Clustering of the 5' ESTs and Calculation of Noveltv Indices for cDNA Ubraries 

For each sequmced EST library, the sequences were clustered by the 5' end. Each 
sequence in the library was compared to the others with BLASTN2 (direa strand, parameters 
S=107). ESTs with Scoring Segment Pairs (HSPs) at least 25 bp long, having 95% 
identical bases and beginning closer than 10 bp from each EST 5' end were grouped. The 

30 longest sequence found in the cluster was used as representative of the group. A global 
clustering between libraries was then performed leading to the definition of super-contigs. 
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To assess the yield of new sequences within the EST Kbraries, a novelty rate (MR) 
was deOiied as: NR= 100 X (Number of new unique sequences found in the library,TotaI 
number of sequences from the library). Typically, novelty rating ranged between 10% and 
41% depending on the tissue from which the EST library was obtained. For most of the 
5 libraries, the random sequencing of 5' EST libraries was pursued until the novelty rate 
reached 20%. 

Following characterization as described above, the collection of 5' ESTs in 
NetGene™ was screened to identify those 5' ESTs bearing potential signal sequences as 
1 0 described in Example 22 below. 

EXAMPLE 22 

Identification of Potentiai Signal Sequences in 5' ESTs 
The 5' ESTs in the NetGene™ database were screened to identify those havii^ an 

1 5 uninterrupted open reading frame (ORF) longer than 45 nucleotides b^inning with an ATG 
codon and extending to the end of the EST. Approximately half of the cDNA sequences in 
NetGene™ contained such an ORF. The ORFs of these 5' ESTs were then searched to 
identify potential signal motifi using slight modifications of the procedures disclosed in Von 
Hdjne. Nucleic Acids Res. 14:4683-4690, 1986, the disclosure of which is incorporated 

20 Jierein by reference. Those 5' EST sequences encoding a stretch of at least 13 amino acid 
long with a score of at least 3.5 in the Von Heijne signal peptide identification matrix were 
considered to possess a signal sequence. Those 5' ESTs which matched a known human 
mRNA or EST sequence and had a 5' end more than 20 nucleotides downstream of the 
known 5' end were exchided from further analysis. The remaining cDNAs having signal 

23 sequences therein were included in a database called SignalTag*^. 

To confirm the accuracy of the above method for identifying signal sequences, 
the analysis of Example 23 was performed. 



30 
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EXAMPLE 23 

Confirmation of Acairacv of Identification of P otential Signal Seoucsnces in !? < ;Ty 
The accuracy of the above procedure for identifying signal sequences encoding signal 
peptides was evaluated by applying the method to the 43 amino acids located at the N 
5 terminus of all human SwissProt proteins. The computed Von Hdjnc score for each protein 
was compared with the known characterization of the protein as being a secreted protein or a 
non-secreted protein. In this manner, the number of non-secreted proteins having a score 
higher than 3.5 (false positives) and the number of secreted proteins having a score lower 
ttian 3.5 (false negatives) could be calculated. 
10 Using the results of the above analysis, the probability that a peptide encoded by the 

5' region of the mRNA is in fact a genuine signal peptide based on its Von Heijne's score was 
calculated based on either the assumption that 10% of human proteins are secreted or the 
assumption that 20% of human proteins arc secreted. The results of this analysis are shown 
in Figure 2 and table IV. 

15 Using the above method of identification of secretory proteins, 5' ESTs of the 

following polypeptides known to be secreted were obtained: human glucagon, gamma 
interferon induced nK>nokine precursor, scaeted cydophilin-like protein, human pleioux)pin, 
and human biotinidase precursor. Thus, the above method successfully identified those 5' 
ESTs which encode a signal peptide. 

20 

To confinn that the signal peptide encoded by the 5' ESTs actually functions as a 
signal peptide, the signal sequences firom the 5' ESTs may be cloned into a vector designed 
for the identification of signal peptides. Such vectors are designed to confer the ability to 
grow in selective medium only to host cells containing a vector with an operably linked signal 

25 sequence. For example, to confirm that a 5' EST encodes a genuine signal peptide, the signal 
sequence of the 5' EST may be inserted upstream and in frame with a non-secreted form of 
the yeast invertase gene in signal peptide selection vectors such as those described in U.S 
Patent No. 5,536,637, the disdosure of which is incorporated herein by reference. Growth of 
host cells containing signal sequence selection vectors with the correctly inserted 5* EST 

30 signal sequ^ice confirms that the 5' EST encodes a genuine agnal peptide. 
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Altemativdy, the presence of a signal peptide may be confinned by clonii^ the 
extended cDNAs obtained using the ESTs into expression vectors such as pXTI (as 
described below in example 30), or by constructing pronioter-signal sequence-reponer gene 
vectors which encode fusion proteins between the signal peptide and an assayable reporter 

5 protein. After introduction of these vectors into a suitable host cell, such as COS cells or 
NIH 3T3 cells, the growth medium nay be harvested and analyzed for the presence of the 
secreted protein. The medium from these cdls is compared to the medium torn control cells 
containing vectors lacking the signal sequence or extended cDNA insert to identify veaors 
whidi encode a functional signal peptide or an authentic secreted proteia 

0 Those 5' ESTs which encoded a signal peptide, as determined by the method of 

Example 22 above, were further grouped into four categories based on their homology 
to known sequences as described in Example 24 below. 



EXAMPLE 24 

1^ Categorization of 5^ ESTs E ncoding a Signal Peptide 

Those 5' ESTs having a sequence not matdiing any known vertebrate sequence nor 
any publicly available EST sequence were designated "new." Of the sequences in the 
SignalTag™ database, 947 of the 5' ESTs having a Von Heijne's score of at least 3.5 fell into 
this cat^oiy. 

20 Those 5' ESTs having a sequence not matchii^ any vertebrate sequence but matching 

a publidy known EST were designated "EST-ext". provided that the known EST sequence 
was extended by at least 40 nucleotides in the 5' direction. Of the sequences in the 
SignalTag™ database, 150 of the 5' ESTs having a Von Heijne's score of at least 3.5 feU into 
this cat^ory. 

25 Those ESTs not matching any vertebrate sequence but matching a publicly known 

EST without extending the known EST by at least 40 nucleotides in the 5' direction were 
designated "EST." Of the sequences in the SignalTag™ database, 599 of the 5' ESTs having 
a Von Heijne's score of at least 3.5 fell into this cat^ory. 

Those 5' ESTs matching a human mRNA sequence but extending the know^i 

30 sequence by at least 40 nucleotides in the 5' direaion were designated "VERT-exi." Of the 
sequences in the SignalTag™ database, 23 of the 5' ESTs having a Von Heijne's score ol ii 



wo 99/06549 



PCT.lB98/0i231 



least 3.5 fell into this category. Included in this category was a 5' EST which extended the 
known sequence of the human translocasc mRNA by more than 200 bases in the S' direction. 
A 5' EST wliich axtcndcd Uic sequence ofa human tumor suppressor gene in tte S' direction 
was also identiiicd. 

5 Tabic V shows the distribution of 5' ESTs in each category and the number of 5' 

ESTs in each category having a given minimum von Heijnc's score. 

3 Evaluation of Spatial and Temporal Expres sion of mRNlAs Corresponding to the 
S'ESTs or Extended cDNAs 

10 

Each of the 5' ESTs was also categorized based on the tissue from which its 
corresponding niRNA was obtained, as described below in Example 25. 

EXAMPLE 25 

15 Categorization of Expression Patterns 

Table VI shows tlie distribution of 5' ESTs in each of the above defined category 
with respect to the tissue from which the 5'ESTs of the corresponding mRNA were obtained 
Table II provides the sequence identification numbers of 5' EST sequences derived 
fix)m testis and other tisssues, the categories in which these sequences fall, and the von 

20 Heijne's score of the signal peptides which they encode. The 5' EST sequences and the amino 
add sequences they encode are provided in the appended sequence listings. Table 111 
provides the sequence E> numbers of the S' ESTs and the sequences of the signal peptides 
which they encode. The sequences of the 5' ESTs and the polypeptides they encode arc 
provided in the sequence fisting appended hereto. 

25 The sequences of DNA SEQ ED NOs: 38-270 can readily be screened for any errors 

therein and any sequence ambigwties can be resolved by resequencing a fragment containing 
such errors or amibiguities on both strands. Such fragments may be obtained from the 
plasmids stored in the inventors* laboratory or can be isolated using the techniques describee 
herein. Resolution of any such ambiguities or errors may be facilitated by using primers 

30 which hybridize to sequences located close to the ambiguous or erroneous sequences. For 
example, the primers may hybridize to sequences within 50-75 bases of the amibiguity or 
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error. Upon resoludon of an error or ambiguity, the corresponding corrections can be made 
in the protein sequences encoded by the DNA containing the error or amibtgutcy. 

In addhion to categorizing die 5' ESTs with respect to their tissue of origin, the 
spatial and temporal expression patterns of the mRNAs corresponding to the 5' ESTs, as well 
5 as their expression levds, may be determined as described in Example 26 below. 
Characterization of the spatial and temporal expression patterns and expresaon levels of these 
mRNAs is useful for constructing expression vectors capable of producing a desired level of 
gene product in a desired spatial or temporal manner, as will be discussed b more detail 
bdow. 

10 Furthermore, S' ESTs whose corresponding mRNAs are associated with disease 

states may also be identified. For exanq^le, a particular disease may result from the lack of 
expression, ova expresaon, or under expression of an mRNA corresponding to a S' EST. 
By conq)aring mRNA expression patterns and quanddes in samples taken from healthy 
individuals with those from individuals sufiertng fit)m a particular disease, 5' ESTs 

15 responsible for the disease may be identified. 

It will be appreciated that the results of the above characterization procedures for 
5' ESTs also apply to extended cDNAs (obtainable as described below) which contain 
sequences adjacent to the 5' ESTs. It will also be appreciated that if desired, 
20 characterization may be delayed until extended cDNAs have been obtained rather than 
characterizing the ESTs themselves. 

EXAMPLE 26 
Evaluation of Expression Level s and Patterns of mRNAs 
25 Corresponding to 5' ESTs or Extended cDNAs 

Expression levds and patterns of mRNAs corresponding to 5' ESTs or extended 
cDNAs (obtainable as described below in example 27) may be analyzed by solution 
hybridization with long probes as described in International Patent Application No. WO 
97/05277, the entire contents of which are hereby incorporated by reference. Briefly, a 5' 
30 EST, extended cDNA, or fragment thereof corresponding to the gene encoding the mRNA to 
be charaaerized is inserted at a cloning site immediately downstream of a bacteriophage (T3, 
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T7 or SP6) RNA polymerase promoter to produce antisense RNA. Preferably, the 5' EST or 
extended cDNA has 100 or more nucleotides. The plasmid is linearized and transcribed in the 
presence of ribonucleotides comprising modified ribonucleotides (i.e. biotin-UTP and DIG- 
UTP). An excess of this doubly labeled RNA is hybridized in solution with mRNA isolated 
S firom cells or tissues of interest. The hybridizations are performed under standard stringent 
conditions (40-50*C for 16 hours in an 80% formamide. 0.4 M NaCl buflFer, pH 7-8). The 
unhybridized probe is removed by digestion with ribonucleases ^edfic for sii^e-stranded 
RNA (Le. RNases CL3, Tl, Phy M, U2 or A). The presence of the biotin-UTP modification 
enables o^ture of the hybrid on a microtitration plate coated with streptavidin. The presence 

10 of the DIG modification enables the hybrid to be detected and quantified by ELISA using an 
anti-DIG antibody coupled to alkaline phosphatase. 

The 5' ESTs, extended cDNAs, or Segments thereof may also be tagged with 
nucleotide sequences for the serial analysis of gene expression (SAGE) as disclosed in UK 
Patent Application No, 2 305 241 A, the entire contents of which are incorporated by 

IS reference. In this method, cDNAs are prepared fi-om a cell, tissue, organism or other source 
of nucleic acid for which gene expression patterns must be determined. The resulting cDNAs 
are separated into two pools. The cDNAs in each pool are cleaved with a first restriction 
endonuclease, called an anchoring enzyme, having a recognition site which is likely to be 
present at least once in most cDNAs. The fi-agments which contain the S' or 3' most r^on 

20 of the cleaved cDNA are isolated by binding to a capture medium such as streptavidin coated 
beads. A first oltgonudeotide linker having a first sequence for hybridization of an 
amplification primer and an internal restriction site for a so-called tagging endonudease is 
ligated to the digested cDNAs in the first pool. Digestion witii the second endonudease 
produces short tag fragments fiom the cDNAs. 

25 A second oligonucleotide having a second sequence for h>tridization of an 

amplification primer and an internal restriction site is ligated to the digested cDNAs in the 
second pool. The d>NA Segments in die second pool are also digested with the tagging 
endonudease to generate short tag fragments derived from the cDNAs in the second pool. 
The tags resulting fit)m digestion of the first and second pools with the anchoring enzyme and 

30 the tagging endonudease are ligated to one another to produce so-called ditags. In some 
embodiments, the ditags are concatamerized to produce ligation products containing from 2 
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to 200 ditags. The tag sequences are then determined and compared to the sequences of the 
5' ESTs or extended cDNAs to determine which 5' ESTs or extended cDNAs are expressed 
in the cell, tissue organism, or other source of nucleic adds from which the tags were 
derived. In diis way, the expression pattern of the 5' ESTs or extended cDNAs in the cell, 
S tissue, organism, or other source of nudetc acids is obtained. 

Quantitative analysis of gene expression may also be performed using arrays. As used 
herein, the term array means a one dimcnaonal, two dimensional, or multidimensional 
arrangement of full length cDNAs (Le, extended cDNAs which include the coding sequence 
for the signal pq)tide, the coding sequence for the mature protein, and a stop codon), 

10 extended cDNAs, 5' ESTs or fragments diereof of sufiBcient length to permit specific 
detection of gene expressioa Preferably, the fragments are at least 15 nucleotides in length. 
More preferably, the fragments are at least 100 nucleotide long. More preferably, the 
fragments are more than 100 nucleotides in length. In some embodiments, the fragments may 
be more than SOD nucleotide long. 

IS For example, quantitative analysis of gene expression may be performed with full 

length cDNAs as defined below, extended cDNAs, 5' ESTs, or fragments thereof in a 
complementary DNA microarray as described by Schena et al. (Science 270:467-470, 1995; 
Proc. NatL Acad Sci. U.SjL 93:10614-10619, 1996). FuU length cDNAs, extended cDNAs. 
5' ESTs or fragments thereof are amplified by PCR and arrayed from 96-well microtiier 

20 _plates onto silylated microscope slides using high-speed robodcs. Printed arrays are 
incubated in a humid chamber to allow rehydration of the array elements and rinsed, once in 
0.2% SDS for 1 min, twice in water for 1 min and once for 5 min in sodium borohydride 
soludoa The arrays are submerged in water for 2 min at 95T, transferred into 0.2% SDS 
for 1 min, rinsed twice with water, air dried and stored in the dark at 25''C 

25 Cell or tissue mRNA is isolated or commercially obtained and probes are prepared by 

a sir^e round of reverse transcription. Probes are hybridized to 1 cm^ microaxrays under a 
14 X 14 mm glass coverslip for 6-12 hours at 60°C. Arrays are washed for S min at 2S^C in 
low stringency wash buffer (1 x SSC/0.2% SDS), then for 10 min at room temperature in 
high stringency wash buflFer (0. 1 x SSC/0.2% SDS). Arrays are scanned in 0. 1 x SSC using a 

30 fluorescence laser scanning device fitted with a custom filter set. Accurate differential 
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expression measurements are obtained by taking the average of the ratios of two independent 
hybridizations. 

Quantitative analysis of the expression of genes may also be performed vnth fiiil 
length cDNAs, extended cDNAs, 5' ESTs, or fragments thereof in complementary DNA 
S arrays as described by Pietu et a/.. {Genome Research 6:492-503, 1996). The full length 
d)NAs, extended cDNAs, S' ESTs or fragments thereof are PCR amplified and sponed on 
membranes. Then, mRNAs originating fi'om various tissues or cells are labeled with 
radioactive nucleotides. After hybridization and washing in controlled conditions, the 
hybridized mRNAs are detected by phospho-imaging or autoradiography. Duplicate 
10 experiments are performed and a quantitative analysis of differentially expressed mRNAs is 
then performed. 

AhematK^y, expression analysis of the 5' ESTs or extended cDNAs can be done 
through high density nucleotide arrays as described by Lockhart et al. {Nature Biotechtology 
14: 1675-1680, 1996) and Sosnowsky etaL {Proa NatL Acad Set. 94:1119-1123. 1997). 

15 Oligcmucleotides of 15-50 nucleotides conesponding to sequences of the 5' ESTs or 
extended cDNAs are synthesized directly on the chip (Lockhart et aL, siq>ra) or synthesized 
and then addressed to the chip (Sosnowsky et aL, suprd). Preferably, the oligonucleotides 
are about 20 nucleotides in length. 

cDNA probes labeled with an appropriate compound, such as biotin, digoxigenin 

20 or fluorescent dye, are synthesized from the appropriate mRNA population and then 
randomly fragmented to an average size of 50 to 100 nucleotides. The said probes are 
then hybridized to the chip. After washing as described in Lockhart et al, sttpra and 
application of different electric fields (Sonowsky et al, supra.), the dyes or labeling 
compounds are detected and quantified Duplicate hybridizations are performed. 

25 Comparative analysis of the intensity of the signal originating from cDNA probes on the 
same target oligonucleotide in different cDN.A samples indicates a differential expression 
of the mKNA conesponding to the 5' EST or extended cDNA from which the 
oligonucleotide sequence has been designed 
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ffl. Use of 5' ESTs to Clone Extended cDNAs and to Gone the Corresponding 
Genomic DNAs 

Once 5' ESTs which include the 5' end of the conresponding mRNAs have been 
selected using the procedures described above, they can be utilized to isolate Gctended 
5 cDNAs which contain sequences adjacent to the 5' ESTs. The extended cDNAs may include 
the entire coding sequence of the protein encoded by die corresponding mRNA, including the 
authentic translation start site, the signal sequence, and the sequence encoding the mature 
protein remaining after cleavage of the signal peptide. Such extended cDNAs are referred to 
herein as "full length cDNAs." Alternatively, the extended cDNAs may include only the 

10 sequence encoding the mature protein remaining after cleavage of the signal peptide, or only 
the sequence encoding the signal peptide. 

Example 27 below describes a general method for obtaining extended cDNAs using 
5* ESTs. Exanq)le 28 below provides experimental results, using the method explained in 
example 27, describing several extended cDNAs tnchiding die entire coduig sequence and 

1 S authentic 5' end of the corresponding mRNA for several secreted proteins. 

The mediods of Examples 27, 28, and 29 can also be used to obtain extended cDN.As 
vMdi encode less than the entire coding sequence of the secreted proteins encoded by the 
genes corresponding to the 5' ESTs. In some embodiments, the extended cDNAs isolated 
using these methods encode at least 10 amino acids of one of the proteins encoded by the 

20 .sequences of SEQ ID NOs: 38-270. In fiirther embodiments, the extended cDNAs encode 
at least 20 amino adds of the protons encoded by the sequences of SEQ ID NOs: 38-270 In 
fiirther embodiments, the extended cDNAs encode at least 30 amino amino acids of the 
sequences of SEQ ID NOs: 38-270. In a preferred embodiment, the extended cDNAs 
encode a fijU length protein sequence, which includes the protein coding sequences of SEQ 

25 ID NOs: 38-270. 

EXAAVIPLE27 

general Method for Using 5' ESTs to Clone and Sequence cDNAs which Inchide the Entire 
Coding Region and th e Authentic 5^ End of the Corresponding mRNA 
30 The following general method has been used to quickly and efticiendy isolate 

extended cDNAs having the authentic 5' ends of their corresponding mRNAs as weil j> 
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the full protein coding sequence and including sequence adjacent to the sequences of the 
5' ESTs used to obtain them. This method may be applied to obtain extended cDNAs 
for any 5' EST in the NctGcnc^ database, including those 5' ESTs encoding 
polypeptides belonging to secreted proteins. The method is summarized in figure 3. 

S 

I Qbtention of Extended cDNAs 
a) First strand synthesis 

The method takes advantage of the known S' sequence of the mRNA. A reverse 
transcription reaction is conducted on purified mRNA with a poly 14dT primer containing a 

1 0 49 nucleotide sequence at Its end allowing the addition of a known sequence at the end of 
the cDNA which corresponds to the 3' end of the mRNA. For example, the primer may luive 
Uie following sequence: S'-ATC GTT GAG ACT CGT ACC AGO AGA GTC ACG AGA 
GAG ACT ACA CGG TAC TGG TTT TTT TTT TTT TTVN -3' (SEQ ID NO: 14). Those 
skilled in the art will appreciate that other sequences may also be added to the poly dT 

IS sequence and used to prime the first strand synthe^s. Using this primer and a reverse 
transcriptase such as the Superscript 11 (Qbco BRL) or Rnase H Minus M-MLV (Promcga) 
enzyme, a reverse transcript anchored at the 3' polyA site of the RNAs is generated. 

After removal of the mRNA hybridized to the first cDNA strand by alkaline 
hydrolysis, the products of the alkaline hydrolysis and the residual poly dT primer are 

20 eliminated with an exclusion column such as an AcA34 (Biosepra) matrix as explained in 
Example 11. 

b) Second strcaKl synthesis 

A pur of nested primers on each end is designed based on the known S' sequence 
from the 5' EST and the known 3' end added by the poly dT primer used in the first strand 
25 synthesis. Softwares used to design primers are either based on GC content and melting 
temperatures of oligonucleotides, such as OSP (lUier and Green, PCR Metk Appi. 1:124- 
128, 1991), or based on the octamer frequency disparity method (Griffais ei ai. Nucleic 
Acids Res. 19: 3887-3891, 1991) such as PC-Rare 
(httpi/bioinformatics. wciznMnn.ac.il/software/PC-Rare/doc/nmud 
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Pref erably, the nested primers at the 5' end are separated from one another by four to 
nine bases. The S' phmer sequences be selected to have meltii^ temperatures and 
spedfichies suitable for use in PGR. 

Preferably, the nested primers at the 3' end are separated from one another by four to 
5 nine bases. For example; the nested 3' primers may have the following sequences: (S'- CCA 
GCA GAG TCA CGA GAG AGA CTA CAC GG .3'(SEQ ID NO: 1 5), and 5'- C AC GAG 
AGA GAC TAC ACG GTA CTG G -3' (SEQ ID NO: 16). These primers were seleaed 
because they have melting temperatures and specificities compatible with their use in PCR. 
However, those skilled in the art will appreciate that other sequences may also be used as 
10 primers. 

The first PCR run of 25 cycles is performed using the Advantage Tth Polymerase ir 
Mix (Clontech) and the outer primer from each of the nested pairs. A second 20 cycle 
PCR using the same enzyme and the inner primer from each of the nested pairs is then 
performed on 1/2500 of the first PCR product. Thereafter, the primers and nucleotides 
15 are removed. 



2. Sequencing of FuU Length Extended cDNAs or Fragments Thereof 

Due to the lack of position constraints on the deagn of 5' nested primers 

compatible for PCR use using the OSP software, amplicons of two types are obtained 
20 Preferably, the second 5' primer is located upstream of the translation initiation codon 

thus yielding a nested PCR product containing the whole coding sequence. Such a full 

length extended cDNA undergoes a direct cloning procedure as described in section a. 

However, in some cases, the second 5' primer is located downstream of the translation 

initiation codon, thereby yielding a PCR product containing only part of thjc ORF. Such 
25 incomplete PCR products are submitted to a modified procedure described b section b. 

a) Nested PCR products containing complete ORFs 

When the resulting nested PCR product contains the complete coding sequence, as 

predicted from the S'EST sequence, it is cloned in an appropriate vector such as pED6dpc2, 

as described in secdon 3. 



30 
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h) Nested PCR products containing incompfefe ORFs 

When the ampiicon does not contain the cottiplcte coding sequence, intermediate 
steps arc necessary to obtain both the complete coding sequence and a PGR product 
containing the Tull coding sequence. The complete coding sequence can be assembled 
S Trom several partial sequences determined directly from diflcrent PCR products as 
described in the following section. 

Once the Tull coding sequence has been completely determined, new primers 
compatible for PGR use are designed to obtain amplicons containing the whole coding 
region. However, in such cases, 3* primers compatible for PGR use are located inside the 

10 3* UTR of the corresponding mRNA. thus yielding amplicons which lack part of this 
region, i.e. the polyA tract and sometimes the polyadenylation signal, as illustrated in 
figure 3. Such full length extended cDNAs are then cloned into an appropriate vector as 
described in section 3. 
c) Sequencing extended cDNAs 

IS Sequencing of extended cDNAs is performed using a Die Terminator approach 

with the AmpliTaq DNA polymerase FS kit available from Perkin Elmer. 

In order to sequence PGR fragments, primer walking is performed using software 
such as OSP to choose primers and automated computer software such as ASMG (Sutton el 
ai. Genome Science Tedtml 1: 9-19, 1995) to construct contigs of walking sequences 

20 including the initial S' tag using minimum overiaps of 32 nucleotides. Preferably, primer 
walking is perfonmed until the sequences of full length cDN As are obtained. 

Gompletion of the sequencing of a given extended cDNA fragment is assessed as 
fbltows. Since sequences located after a polyA tract are difficult to determine precisely in the 
case of undoned products, sequencing and primer walking processes for PGR products are 

25 interrupted when a polyA tract is identified in extended cDNAs obtained as described in case 
b. The sequence length is compared to the size of the nested PGR produa obtained as 
described above. Due to the limited accuracy of the determination of the PGR product size 
by gel electrophoresis, a sequence is considered complete if the size of the obtained sequence 
is at least 70 % the size of the first nested PGR product. If the length of the sequence 

30 determined from the computer analysis is not at least 70% of the length of the nested PGR 
product, these PGR products are cloned and the sequence of the insertion is detennined. 
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When Northern blot data are available, the size of the mRNA detected for a given PGR 
product is used to finally assess that the sequence is complete. Sequences which do not fulfill 
the above criteria are discarded and will undergo a new isolation procedure. 

Sequence data of all extended cDNAs are then transferred to a proprietary 
S database, where quality controls and validation steps are carried out as described in 
example IS. 

3 Cloning of Full Length Extended cDNAs 

The PGR product containing the full coding sequence is then cloned in an appropriate 

10 veaor. For example, the extended cDNAs can be cloned into the expression vector 
pEP6dpc2 (DiscoverEase, Genedcs Institute, Cambridge, MA) as follows. pED6dpc2 
vector DNA is prepared with bhint ends by performing an EcoRI digestion followed by a fill 
in reactioa The Wunt ended vector is dephosphorylated. After removal of PGR primers and 
ethanol precipitation, the PGR product containing the fiill coding sequence or the extended 

1 S cDNA obtained as described above is phosphorylated with a kinase subsequently removed by 
phenol-Sevag extraction and precipitatioa The double stranded extended cDNA is then 
ligated to the vector and the resuhing expression plasmid introduced into appropriate host 
cells. 

Since the PGR products obtained as described above are blunt ended molecules that 
20 can be doned in either direction, the orientation of several clones for each PGR produa is 
detetnuned. Then, 4 to 10 clones are ordered in microtiter plates and subjected to a PGR 
reaction using a first primer located in the veaor close to the cloning site and a second primer 
located in Ae portion of the extended cDNA conresponding to the 3' end of the mRNA. This 
second primer may be the antisense primer used in anchored PGR in the case of direct cloning 
25 (case a) or the antisense primer located inside the 3'UTR in the case of indirea cloning (case 
b). Clones in which the start codon of the extended cDNA is operably linked to the promoter 
in the vector so as to permit expression of the protein encoded by the extended cDNA are 
conserved and sequenced. In addition to the ends of cDNA inserts, approximately 50 bp of 
vector DNA on each side of the cDNA insen are also sequenced. 
30 The cloned PGR products are then entirely sequenced according to the 

aforementioned procedure. In this case, contigation of long fragments is then performed 
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on walking sequences that have akeady contigated for uncloned PCR products during 
primer walking Sequencing of cloned amplicons is complete when the resulting contigs 
include the whole coding region as well as overlapping sequences with vector DNA on 
both ends. 

4 Computer analvsis of F uD Length Extended cDNA 

Sequences of all full length ectended cDNAs are then submitted to fiirther analy^s as 
described below. Before searching the extended full length cDNAs for sequences of interest, 
extended cDNAs which are not of interest (vector RNAs, transfer RNAs, ribosomal RNAs, 
mitochondrial RNAs, prokaryotic RNAs and fungal RNAs) are discarded using methods 
essentially similar to those described for S'ESTs in Example 18. 

a) Identification of stmctwral features 

Structural features, e.g. polyA tail and polyadenjdation signal, of the sequences of full 
length extended cDNAs are subsequendy determined as follows. 

A poiyA tail is defined as a homopolymeric stretch of at least 1 1 A with at most one 
alternative base within it The polyA tail search is restricted to the last 100 nt of die sequence 
and limited to stretches of 1 1 consecutive As because sequencing reactions are often not 
readable after such a polyA stretch. Stretches having more than 90% homology over 8 
nucleotides are identified as polyA tails using BLAST2N. 

To search for a polyadenylation signal, the polyA tail is clipped from the full- 
length sequence. The SO bp preceding the polyA tail are first searched for the canonic 
polyadenylation AAUAAA signal and, if the canonic signal is not detected, for the 
alternative AUUAAA signal (Sheets et al., Nuc. Acids Res. 18: 5799-5805, 1990) If 
neither of these consensus polyadenylation signals is found, the canonic motif is searched 
^ain allowing one mismatch to account for possible sequencing errors. More than 85 > 
of identified polyadenylation signals of either type actually ends 10 to 30 bp trom tne 
polyA tail. Alternative AUUAAA signals represents approximately 15 % of the iota: 
number of identified polyadenylation signals. 

b) Identification of functional features 

Funaional features, e.g. ORFs and signal sequences, of the sequences of full iengin 
extended cDNAs were subsequently determined as follows. 
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The 3 upper strand frames of extended cDNAs are searched for ORFs defined as the 
maximum length fragments beginning with a translation intiation codbn and ending with a 
stop codon. ORFs encoding at least 20 amino acids are preferred. 

Each found ORF is then scanned for the presence of a signal peptide in the first 
5 SO amino-acids or, where appropriate, within shoner regions down to 20 amino acids or 
less in the ORF, using the matrix method of von Heijne (M/c Acids Res. 14: 4683-4690, 
1986), the disclosure of which is incorporated herein by reference as described in 
Example 22. 

c) Homology to either mcleotidic or proteic sequences 
10 Categorization of fidl-ler^ sequences may be achieved using procedures essentially 

similar to those described for S*ESTs in Example 24. 

Extended cDNAs prepared as described above may be subsequently engineered to > . 
obtain nucleic acids which inchide desired portions of the extended cDNA using convoitional 

IS techniques such as subdoning, PCR, or in vitro oligonucleotide synthesis. For example, 
nucleic acids which include only the full coding sequences {i.e. the sequences encoding the 
signal peptide and the mature protein remaining after the signal peptide is cleaved off) may be 
obtained using techniques known to those skilled in the art. Alternatively, conventional 
techniques may be applied to obtain nucleic acids which contain only the coding sequences 

20 for the mature protein remaining after the signal peptide is cleaved oflf or nucleic adds which 
contain only the coding sequences for the signal peptides. 

Similariy, nuddc adds contaimng any other desired pordon of the coding sequences 
for the secreted protdn may be obtained. For example, the nucldc add may contain at least 
10 consecutive bases of an extended cDNA such as one of the extended cDNAs described 

2S below. In another embodiment, the nuddc acid may contain at least IS consecutive bases of 
an extended cDNA such as one of the extended cDN As described bdow. Altemativdy, the 
nucleic add may contain at least 20 consecutive bases of an extended cDNA such as one of 
the extended cDNAs described below. In another embodiment, the nucleic add may contain 
at least 25 consecutive bases of an extended cDNA such as one of the extended cDNAs 

30 described below. In yet another embodiment, the nucleic acid may contain at least 40 



wo 99/06549 



50 



FCT/IB9S/01231 



consecutive bases of an extended cDNA such as one of the extended cDNAs described 
below. 

Once an extended cDNA has been obtained, it can be secjuenced to detennine the 
amino add sequence it encodes. Once the encoded amino acid sequence has been 
5 determined, one can create and identify artf of the many conceivable cDN As that will encode 
that protein by simply uang the degeneracy of the genetic code. For example, allelic variants 
or other homologous nucleic acids can be identified as described below. Alternatively, 
nucleic acids encoding the desired amino acid sequence can be synthesized in vitro. 

In a preferred embodiment, the coding sequence may be selected using the known 
1 0 codon or codon pair preferences for the host organism in which the cDN A is to be expressed 

The extended cDNAs derived from the 5' ESTS of the present invention were ?* 
obtained as described in Example 28 below. 

EXAMPLE 2S 

15 Characterization of cloned extended cDNAs obtained using S' ESTs 

The procedure described in Example 27 above was used to obtain the extended 
cDNAs derived from the 5' ESTs of the present invention in a variety of tissues. The 
following list provides a few examples of thus obtained extended cDNAs. 

Using this approadi, the M length cDNA of SEQ ID NO: 17 Qntemal identification 
20 number 48-19-3-Gl-FLl) was obtained. This cDNA &lls into the "EST-cxt" category 
described above and encodes the signal peptide MKKVLLLITAILAVAVG (SEQ ID NO: 
IS) haviqg a von Heijne score of 8.2. 

The fiiU length cDNA of SEQ ID NO: 19 Cmtemal identification number 58-34-2-E7- 
FL2) was also obtained using this procedure, this cDNA &lis into the "EST-exf category 
25 described above and encodes the signal peptide MWWFQQGLSFLPSALVIWTSA (SEQ ID 
NO:20) having a von Heijne score of 5.5. 

Another fijU length cDNA obtained using the procedure described above has the 
sequence of SEQ ID N0:21 (internal identification number 51-27-1-E8-FL1). This cDNA 
falls into the "EST-ext** category described above and encodes the signal peptide 
30 hm-TTLPSANSANSPVNMPTTGPNSLSYASSALSPCLT (SEQ ID NO 22) having a 
von Heijne score of 5.9. 
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The above prcx:edure was also used to obtain a &11 length cDNA having the sequence 
oFSEQ ID NO:23 (internal identification number 76-4.1-G5-FLI). This cDNA falls into the 
"EST-cxt- category described above and encodes the signal peptide ILSTVTALTFAXA 
(SEQ ID NO:24) having a von Heijne score of 5.5. 

5 The fiill lei^h cDNA of SEQ ID NO:25 (internal identification number 5 1 -3-3-B 10- 

FL3) was also obtained using this procedure. This cDNA falls into the "new- category 
described above and encodes a signal peptide LVLTLCTLPLAVA (SEQ ID NO:26) having 
a von Heijne score of 10. 1. 

The fiill length cDNA of SEQ ID NO:27 Ontemal identification number 58-35-2-FlO- 

10 FL2) was also obtained using this procedure. This cDNA fidls into the **new*' category 
described above and encodes a signal peptide LWLLFFLVTAIHA (SEQ ID NO:28) having 
a von Heijne score of 10.7. 

Bacterial clones containing plasmids containing the full length cDNAs described 
above are presently stored in the inventor's laboratories under the internal identification 

1 5 numbers provided above. The inscns may be recovered fix>m the stored materials by growing 
an aliquot of the appropriate bacterial clone in the appropriate medium. The plasmid DNA 
can then be isolated mng plasmid isolation procedures ^miliar to those skilled in the art such 
as alkaline lysis minipreps or large scale alkalme lyas plasnud isolation procedures. If desired 
the plasmid DNA may be fiirthcr enriched by centrifiigation on a cesium ddoride gradient, 

20 size exclusion chromatograply, or anion exchange chromatography. The plasmid DNA 
obtained uang these procedures may then be manipulated using standard cloning techniques 
ftmiliartothoseskinedintheart. Alternatively, a PGR can be done with primers designed at 
both ends of the cDNA insertion. The PGR product which corresponds to the cDNA can 
then be manipuiated using standard cloning techniques familiar to those skilled in the art. 

25 The polypeptides encoded by the extended cDNAs may be screened for the presence 

of known stmctural or fiinctional motifs or for the presence of signatures, small amino acid 
sequences which are well conserved amongst the members of a protein family. The 
conserved regions have been used to derive consensus patterns or matrices included m the 
PROSITE data bank, in particular in the file prosite.dat (Release 13.0 of No\ ember 1995, 

30 located at http://expasy.hcuge.ch/sprot/prosite.html. Prosite^convert and prosiie^scan 
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programs (http://ulrec3.uniLch/ftpserveur/prosite_scan) may be used to find signatures on the 
extended cDNAs. 

For each pattern obtained with the prosite_convert program firom the prosite.dat file, 
the accuracy of the detection on a new protein sequence may be assessed by evaluating the 
S fiequency of irrelevant hits on the population of human secreted proteins included in the data 
bank SWISSPROT. The ratio between the number of hits on shu£Bed proteins (with a 
window size of 20 amino adds) and the number of hits on native (unshuffled) proteins may be 
used as an index. Every pattern for which the ratio is greater than 20% (one hit on shufQed 
proteins &>r 5 hits on native proteins) may be skipped during the search with prosite^scan. 
10 The program used to shufile protein sequences (db_shuffled) and the program used to 
determine the statistics for each pattern in the protein data banks (prosite_statistics) are 
available on the ftp site httD://ulrec3.unil.ch/ftpserveur/prosite scan . 

In addition to PGR based methods for obtaining extended cDNAs, traditional 
15 hybridization based methods may also be employed. These methods may also be used to 
obtain the genomic DNAs which encode the mRNAs from which the 5' ESTs were 
derived, mRNAs corresponding to the extended cDNAs, or nucleic acids which are 
homologous to extended cDNAs or 3' ESTs. Example 29 below provides examples of 
such methods. 

20 

EXAMPLE 29 

Methods for Obtaining cDNAs which include the Entire Coding Region and the Authentic 

S'End of the Corresponding mRNA 
A fill! length cDNA library can be made using the strategies described in Examples 
25 13, 14, 15, and 16 above by replacing the random nonamer used in Example 14 with an oligo- 
dT primer. For instance, the oligonucleotide of SEQ ID NO: 14 may be used. 

Alternatively, a cDNA library or genomic DNA library may be obtained from a 
commercial source or made using techniques familiar to those skilled in the art. Such cDNA 
or genomic DNA librairies may be used to isolate extended cDNAs obtained from 5' EST or 
30 nucleic acids homologous to extended cDNAs or 5' EST as follows. The cDNA library or 
genomic DNA library is hybridized to a detectable probe comprising at least 10 consecuti\: 
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nucleotides from the 5' EST or extended cDNA using convendonai techniques. Preferably, 
the probe comprises at least 12, 15, or 17 consecutive nucleotides from the 5' EST or 
extended cDNA More preferably, the probe comprises at least 20 to 30 consecutive 
nucleotides from the 5' EST or extended cDNA. In some embodiments, the probe comprises 
5 more than 30 nucleotides from the 5' EST or extended cDNA. 

Techniques for identifying cDNA dones in a cDNA library which hybridize to a given 
probe sequence are disclosed in Sambrook et aL, Moleatlar Cloning: A Laboratory Mamal 
2d Ed,, Cold Spring Harbor Laboratory Press. 1989, the disclosure of which is incorporated 

1 0 herein by reference. The same techniques may be used to isolate genomic DNAs. 

Briefly. cDNA or genomic DNA clones which hybridize to the detectable probe are 
identified and isolated for fiirtiier manipulation as foDows. A probe comprising at least 10 ** 
consecutive nucleotides from the 5' EST or extended cDNA is labeled with a deteaable label *' 
such as a radioisotope or a fluorescent molecule. Preferably, die probe comprises at least 1 2. 

IS 15, or 17 consecutive nucleotides from die 5' EST or extended cDNA. More preferably, the 
probe comprises 20 to 30 consecutive nucleotides from the 5' EST or extended cDNA. In 
some embodiments, the probe comprises more tiian 30 nucleotides from die 5' EST or 
extended cDNA. 

Tedmiques for labeling the probe are well known and include phosphorylation wiuh 
20 -polynucleotide kinase, nick translation, in vitro transcription, and non radioaaive techniques 
The cDNAs or genonric DNAs in the library are transferred to a nitrocellulose or nylon filter 
and denanired. After blocking of non specific sites, die filter is incubated with the labeled 
probe for an amotmt of thne sufficient to allow binding of the probe to cDNAs or genomic 
DNAs containing a sequence capable of hybridizing thereto. 
25 By varying the stringency of the hybridization conditions used to identify 

extended cDNAs or genomic DNAs which hybridize to the detectable probe, extended 
cDNAS having different levels of homology to the probe can be identified and isolated at 
described below. 

30 
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1. Identification of Extended cDNA or Genomic cDNA Sequences Having a High 
Degree of Homology to the Labeled Prob^ 

To identify extended cDNAs or genomic DNAs having a high d^ee of homology to 
the probe sequence, the melting temperature of the probe may be calculated using the 
5 following formulas: 

For probes between 14 and 70 nucleotides in length the melting temperature (Tm) is 
calculated using the formula: Tm=8l.5+16.6(log fNa+])+0.4 1 (fraction G+CK600/N) where 
N is the length of the probe. 

If the hybridization is carried out in a solution containing formamide, the melting 
10 temperature may be calculated using the equation Tm«81.5+16.6(l<^ [Na+])+0.41 (fraction 
GK:H0.63% fonnamideH600/N) where N is the length of the probe. 

Prehybridization may be carried out in 6X SSC, SX Denhardt's reagent, 0.5% SDS, 
100 iig doiatured fi'agmented salmon spenn DNA or 6X SSC, SX Denhardt's reagent, 0.5% 
SDS, 100 denatured fragmented salmon spenn DNA, 50% fonnamide. The fonnulas for 
1 5 SSC and Denhardt's solutions are listed in Sambrook et al. , supra. 

Hybridization is conducted by adding the detectable probe to the prehybridization 
sohitions listed above. Where the probe comprises double stranded DNA, it is denatured 
before addition to the hybridization solunon. The filter is contaaed with the hybridization 
solution for a sufficient period of time to allow the probe to hybridize to extended cDNAs or 
20 genomic DNAs containing sequences complementary thereto or homologous diereto. For 
probes over 200 nucleotides in length, the hybridization may be carried out at 15-25®C below 
the Tm For shorter probes, such as oligonucleotide probes, the hybridization may be 
conducted at I5-25''C bdow the Tm. Preferably, for hybridizations in 6X SSC, the 
hybridization is conducted at approximately 68®C. Preferably, for hybridizations in 50% 
25 formamide containing solutions, the hybridization is conducted at approximately 42°C. 

All of the foregoing hybridizations would be con^dered to be under "stringent" 
conditions. 

Following hybridization, the filter is washed in 2X SSC, 0.1% SDS at room 
temperature for 15 minutes. The filter is then washed with O.IX SSC, 0.5% SDS at room 
30 temperature for 30 minutes to I hour. Thereafter, the solution is washed at the hybridization 
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temperature in O.lX SSC, 0.5% SDS. A final wash b conduaed in O.IX SSC at room 
temperature. 

Extended cDNAs, nucleic acids homologous to extended cDNAs or 5' ESTs, or 
genomic DNAs which have hybridized to the probe are identified by autoradiography or 
5 other conventional techniques. 

2. Obtention of Extended cD NA or Genomic cDNA Sequences Having Lower n^ Ptf^^ 
of Homology to the Labeled Probe 

The above procedure may be modified to identify extended cDNAs. nucleic acids 

1 0 homologous to extended cDNAs, or genomic DNAs having decreasing levels of homology to 
the probe sequence. For example, to obtain extended cDNAs, nucleic adds homologous to 
extended cDNAs, or genomic DNAs of decreasing homology to the detectable probe, less 
stringent conditions xnay be used. For example, the hybridLzation temperature may be 
decreased in increments of 5X from 68X to 42T in a hybridization buffer having a sodium 

1 5 concentration of approximate^ IM. FoDowing hybridization, die filter may be washed with 
2X SSC, 0.5% SDS at the tenq)erature of hybridization. These conditions are considered to 
be "moderate" conditions above SOX and "low" conditions below 50*C. 

Alternatively, the hybridization may be carried out in buflFers, such as 6X SSC, 
containing fbrmamide at a temperature of 42^C. In this case, the concentration of formamide 

20 in the hybridization buflfer may be reduced in 5% increments from 50% to 0% to identify 
clones having decreasing levels of homology to the probe. Following hybridization, the filter 
may be washed with 6X SSC, 0.5% SDS at 50T. These conditions are considered to be 
"moderate" comfitions above 25% fbimamide and "low" conditions below 25% formamide. 
Extended cDNAs, nucleic acids homologous to extended cDNAs, or genomic 

25 DNAs which have hybridized to the probe are identified by autoradiography. 

3. Determination of the De gree of Homology Between the Obtained Extended cDNAs 
and the Labeled Probe 

If it is desired to obtain nucleic acids homologous to extended cDNAs, such as allelic 
30 variants thereof or nucleic acids encoding proteins related to the proteins encoded by the 
extended cDNAs, the level of homology between the hybridized nucleic acid and the 
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extended cDNA- or S' EST used as the probe may be fiirther detennined using BLAST2N; 
parameters may be adapted depending on the sequence length and degree of homology 
studied. To detenmine the level of honx>Iogy between the hybridized nucleic add and the 
extended cDNA or STST from which die probe was derived, the nucleodde sequences of die 

5 hybridized nucleic add and the extended cDNA or S'EST from which the probe was derived 
are compared. For example, using the above methods, nucleic adds having at least 95% 
nucleic add homology to the extended cDNA or 5'EST from which the probe was derived 
may be obtained and identified. Similarly, by using progressively less stringent hybridization 
conditions one can obtain and identify nucleic acids having at least 90%, at least 85%, at least 

10 80% or at least 75% homology to the extended cDNA or 5'EST from which the probe was 
derived. 

To detennine whether a done encodes a protein having a given amount of homology 
to the protein encoded by the extended cDNA or 5' EST, the amino add sequence encoded 
by the extended cDNA or 5' EST is compared to the amino add sequence encoded by the 

15 hybridizing nuddc add Honiology is detennined to &dst when an amino add sequence in 
the extended cDNA or 5' EST is closely related to an amino add sequence in the hybridizing 
nucleic add, A sequence is dosdy related when it is identical to that of the extended cDNA 
or 5' EST or when it contains one or more amino acid substitutions therein in which amino 
adds having similar characteristics have been substituted for one another. Using the above 

20 methods and algorithms such as FASTA with parameters depending on the sequence length 
and d^;ree of homology studied, one can obtain nucldc adds encoding proteins having at 
least 95%, at least 90%, at least 85%, at least 80% or at least 75% homology to the proteins 
encoded by die extended cDNA or 5TEST fix>m which the probe was derived. 

25 In addition to the above described methods, other protocols are available to obtain 

extended cDNAs using 5* ESTs as outlined in the following paragraphs. 

Extended cDNAs may be prepared by obtaining mRNA from the tissue, cell, or 
organism of interest using mRNA preparation procedures utilizing polyA selecdon 
procedures or other techniques known to those skilled in the art. A first primer capable of 

30 hybridizing to the polyA tail of the mRNA is hybridized to the mRNA and a reverse 
transcription reaction is performed to generate a first cDNA strand. 
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The first cDNA strand is hybridized to a second primer containing at least 10 
consecutive nucleotides of the sequences of SEQ ID NOs 38-270. Preferably, the primer 
comprises at least 12, 15, or 17 consecutive nucleotides from the sequences of SEQ ID NOs 
3S-270. More preferably, the primer comprises 20 to 30 consecutive nucleotides from the 
S sequences of SEQ ID NOs 38-270. In some embodiments, the primer comprises more than 
30 nucleotides from the sequences of SEQ ID NOs 38-270. If it is desired to obtain extended 
cDNAs containing the fiill protein coding sequence, including the authentic translation 
initiation site, the second primer used contains sequences located upstream of the translation 
initiation site. The second primer is extended to generate a second cDNA strand 

10 complementary to the first cDNA strand. Alternatively, RT-PCR may be performed as 
described above using primers from both ends of the cDNA to be obtiuried. 

Extended cDNAs containing 5' ficigments of the mRNA may be prepared by 
hybridizing an mRNA comprising the sequence of the 5 'EST for which an extended cDNA is 
desired with a primer comprising at least 10 consecutive nucleotides of the sequences 

15 complementary to the 5 'EST and reverse transcribing the hybridized primer to make a first 
cDNA strand from the mRNAs. Preferably, the primer comprises at least 12, 15, or 17 
consecutive nucleotides from the 5'EST. More preferably, the primer comprises 20 to 30 
consecutive nucleotides from the 5*EST. 

Thereafter, a second cDNA strand complementary to the first cDNA strand is 

20 synthesized. The second cDNA strand may be made by hybricfizir^ a primer complementary 
to sequences in the first cDNA strand to the first cDNA strand and extending the primer to 
generate the second cDNA strand. 

The douUe stranded extended cDNAs made using the methods described above are 
isolated and cloned. The extended cDNAs may be cloned into vectors such as plasmids or 

25 viral veaors capable of replicadng in an appropriate host cefl. For example, the host cell may 
be a baaerial, mammalian, avian, or insea ceQ. 

Techniques for isolating mRNA, reverse transcribing a primer hybridized to mRN.A 
to generate a first cDNA strand, extending a primer to make a second cDNA strand 
complementary to the first cDNA strand, isolating the double stranded cDNA and cloning i:x 

30 double stranded cDNA are well known to those skilled in the art and are described in Current 
Protocols in Molecular Biology, John Wiley and Sons, Inc. 1997 and Sambrook et a: 
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Molecular Cloning: A Laboratory Manual, Second Edition, Cold Sprii^ Harbor Laboratory 
Press, 1989, the entire disclosures of which are incorporated herein by reference. 

Alternatively, procedures such as the one described in Exaniple 29 may be used for 
obtaining full length cDNAs or extended cDNAs. In this approach, full length or extended 
5 cDNAs are prepared from mRNA and cloned into double stranded pb^emids as follows. 
The cDNA library in the double stranded phagemids is then rendered single stranded by 
treatment with an endonuclease, such as the Gene II product of the phage Fl, and an 
exonuclease (Chang et aL, Gene 127:9S*8, 1993). A biotinylated oligonucleotide comprising 
the sequence of a 5' EST, or a fragment containing at least 10 nucleoudes thereof, is 

10 hybridized to the single stranded pliageniids. Preferably, the fragment comprises at least 12, 
15, or 17 consecutive nucleotides from the S' EST. More preferably, the fragment comprises 
20-30 consecudve nucleotides from the 5' EST. In some procedures, the fragment may 
comprise more than 30 consecutive micleotides from the 5' EST. 

Hybrids between the biotinylated oligonucleotide and phagemids having inserts 

15 containing the 5' EST sequence are isolated by incubating the hybrids with streptavidin 
coated paramagnetic beads and retrieving the beads with a magnet (Fry et aL, Biotechiiqucs, 
13: 124-13 1, 1992). Thereafter, the resulting phagemids containing the 5' EST sequence are 
released from the beads and converted into double stranded DNA using a primer specific for 
the 5' EST sequence. Alternatively, protocoles such as the Gene Trapper kit (Gibco BRL) 

20 .may be used. The resulting double stranded DNA is transformed into bacteria. Extended 
cDNAs containing the 5' EST sequence are identified by colony PCR or colony hybridization 

Using any of the above described methods in section ni, a plurality of extended 
cDNAs containing full length protein coding sequences or sequences encoding only the 
25 mature protein remaining after the signal peptide is cleaved off may be provided i% 
cDNA libraries for subsequent evaluation of the encoded proteins or use in diagnostic 
assays as described below. 

IV. Expression of Proteins Encoded by Extended cDNAs Isolated Using 5' EST$ 
30 Extended cDNAs containing the full protein coding sequences of their correspond:.-.^ 

mRNAs or portions thereof, such as cDN.As encoding the mature protein, may be used :^ 
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express the encoded secreted proteins or portions thereof as described in Example 30 bdow. 
If desired, the extended cDNAs may contain the sequences encodmg the signal peptide to 
facilitate secretion of the expressed protein. It will be appreciated that a plurality of extended 
cDNAs containing the fiill protein codir^ sequences or portions thereof may be 
5 simultaneousdy cloned into expression vectors to create an expression library for analysis of 
the encoded proteins as described below. 

EX.\MPLE 30 

Expression of the Proteins Encoded bv the Genes Corresponding 

D toS'ESTS or Portions Thereof 

To express the proteins encoded by the genes corresponding to 5' ESTs (or portions 
thereoQ, iiill length cDNAs containing the entire protein coding region or extended cDNAs 
containing sequences adjacent to the S' ESTs (or portions thereoQ are obtained as described 
in Exanq}les 27-29 and cloned into a suitable expression veaor. If desired, the nucleic acids 

5 may contain the sequences encoding the signal pq)tide to ^dlitate secredon of the expressed 
protein. The nucldc acids inserted into the expression vectors may also contain sequences 
upstream of the sequences encoding the signal peptide, such as sequences which regulate, 
expression levels or sequences which confer tissue specific expression. 

The nucleic acid encoding the protein or polypeptide to be expressed is operably 

3 linked to a promoter in an expression veaor using conventional cloning technology. The 
expression vector may be any of the mammaiian, yeast, insect or bacterial expression systems 
known in the art. Conomerdally available vectors and expression systems are available from a 
variety of suppliers including Genetics Institute (Cambridge, MA), Stratagene (La Jolla, 
California), Promega (Madison, Wisconsin), and Invitrogen (San Diego, California). If 

5 desired, to enhance expression and facilitate proper protein folding, the codon context and 
codon pairing of the sequence may be optimized for the particular »q)ression organism in 
which the expression veaor is introduced, as explained by Hatfield, et al., U.S. Patent No. 
5,082,767, incorporated herein by this reference. 

The cDNA cloned into the expression vector may encode the entire protein (i.e. the 

0 signal pepdde and the mature protdn), the mature protein (/.e. the protein aeated by cleaving 
the signal peptide off), only the signal peptide or any other ponion thereof 
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The foBowing is provided as one exemplary method to express the proteins encoded 
by the extended cDNAs corresponding to the 5' ESTs or the nucleic acids described above. 
First, the methionine initiation codon for the gene and the poIyA signal of the gene are 
identified. If the nucleic acid encoding the polypeptide to be expressed lacks a methionine to 
5 serve as the initiation site, an initiating methionine can be introduced next to the first codon of 
the nucleic acid using conventional techniques. Similarly, if the extended cDNA lacks a 
polyA signal, this sequence can be added to the construct by, for example, splicing out the 
polyA signal from pSGS (Stratagene) using Bgin and Sail restriction endonuclease enzymes 
and incorporating it into the mammalian expression vector pXTl (Stratagene) pXTl 
10 contains the LTRs and a portion of the gag gene from Moloney Murine Leukemia Vims. 
The portion of the LTRs in the construct allow eflBciera stable transfeciion. The vector 
includes the Herpes Simplex thymidine kinase promoter and the selectable neomycin gene. 
The extended cDNA or portion thereof encoding the polypeptide to be expressed is obtained 
by POl fix)m the bacterial vector using oligonucleotide primers complementary to the 
1 5 extended cDNA or portion thereof and containing restriction endonuclease sequences for Pst 
I incorporated into the 5'primer and Bgin at the 5' end of the corresponding cDNA 3' primer, 
taking care to ensure that the extended cDNA is positioned with the poly A signal. The 
purified fi-agment obtained from the resulting PCR reaction is digested with PstI, blunt ended 
with an exonudease, digested with Bgl n, purified and ligated to pXTl containing a poly A 
signal and prepared for this ligation (blunt/Bglll). 

The ligated product is transfeacd into mouse NM 3T3 cells using Lipofectin (Life 
Technologies, Inc., Grand Island. New York) under conditions outlined in the product 
specification. Positive transfectants are selected after growing the transfected cells in 600 
ligtol G418 (Sigma, St. Louis, Missouri). Preferably the expressed protein is released into 
the culture medium, thereby &cilitating purification. 

Ahematively, the extended cDNAs may be cloned into pED6dpc2 as described 
above. The resulting pED6dpc2 constructs may be transfected into a suitable host cell, such 
as COS 1 cells. Methotrexate resistant cells are selected and expanded. Preferably, the 
protein expressed from the extended cDNA is released into the culture medium thereby 
facilitating purification. 
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Protons in the culture medium are separated by gel elearophoresis If desired, the 
proteins may be ammonium sul&te precipitated or separated based on size or charge prior to 
electrophoresis. 

As a control, the expression vector lacking a cDNA insert is introduced into host cells 
5 or organisms and the proteins in the medium are harvested. The secreted proteins present in 
the medium are deteaed using techniques femiliar to diose skilled in the art such as 
Coomassie blue or silver staining or using antibodies against the proton encoded by the 
extended cDNA 

Antibodies c^Ie of specificaUy recogniai^ the protein of interest taay be generated 
10 using synthetic 15-mer peptides having a sequence encoded by the appropriate 5' EST, 
extended cDNA, or portion thereof The synthetic peptides are injected into mice to generate 
antibody to the polypeptide encoded by the 5' EST. extended cDNA, or portion thereof 

Secreted proteins from die host cdls or oiganisms containing an expresaon veaor 
which contains the extended cDNA ddrived from a 5' EST or a portion thereof are compared 

15 to those from the control cells or organism. The presence ofa band in the medium from the 
cells contaimng the expression vector which is absent in the medium from Ae control ceUs 
indicates that Ae extended cDNA encodes a secreted protein. Generally, the band 
corresponding to the protein encoded by the extended cDNA will have a mobility near that 
expected based on the number of amino acids in the open reading frame of the extended 

20 cDNAl However, the band may have a mobility different tiian that expected as a result of 
modifications such as glycosylation. ubiquitinaiion, or en^matic cleavage. 

Alteniatively, if die protein expressed from the above expression vectors does not 
contain sequences directing its secretion, the proteins expressed from host cdb containing an 
ejqncssion vector with an insert encoding a secreted protein or portion thereof can be 

25 compared to the proteins expressed in control host ceUs containing the expression veaor 
without an insert. The presence of a band in samples from ceils containing the expression 
vector whh an insert which is absent in samples from cdls containing the expression veaor 
without an insert indicates that the desired protein or portion thereof is being expressed. 
Generally, the band will have the niobiliiy expeaed for the secreted protein or portion 

30 thereof However, the band may have a mobility different than that expeoed as a result of 
modifications such as glycosylation, ubiquitinaiion, or enzymatic cleavage. 
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The p-otein encoded by the extended cDNA may be purified wmg standard 
immunochroinaiography tedmiques. In such procedures, a solution containing the secreted 
protein, sudi as the culture medium or a cell extract, is applied to a column having antibodies 
against the seaeted protein attached to the chromatography matrix. The secreted protein is 
5 allowed to bind the immunochromatography column Thereafter, the column is washed to 
remove non-specifically bound proteins. The specifically bound secreted protein is then 
released firom the column and recovered using standard techniques. 

If antibody production is not possible, the extended cDNA sequence or portion 
thereof may be incorporated into expression vectors designed for use in purification schemes 

10 employing chimeric polypeptides. In such strategies, the coding sequence of the extended 
cDNA or portion diereof is msened in frame with the gene encoding the other half of the 
chimera. The other halfofthe chimera may be P-globin or a nickel binding polypeptide, A 
chromatography matrix having antibody to p-globin or nickel attached thereto is then used to 
purify the chimeric protein. Protease deavage sites may be engineered between die p-globin 

1 5 gene or the nickel binding polypeptide and the extended cDNA or portion tiiereof Thus, the 
two polypeptides of the chimera may be separated fi^m one another by protease digestion. 

One usefiil expression vector for generating P-globin chimerics is pSG5 (Stratagene), 
which encodes rabbit p-globin, Intron n of the rabbit p-globin gene fecilitates splicing of the 
expressed transcript, and the polyadenylation signal incorporated into the construct increases 

20 the level of expression. These techniques as described are well known to tiiose skilled in the 
art of molecular biology. Standard methods are published in methods texts such as Davis e( 
aL., {Basic Methods in Molecular Biology, Davis, Dibner, and Battey, ed, Elsevier Press, 
NY, 1986) and many of the methods are available from Strat^ene, Life Technologies, Inc.. 
or Prom^a. Polypeptide may additionally be produced from the construct using in vitro 

25 translation systems such as the In vitro Express^' Translation Kit (Stratagene). 

Following expression and purification of the secreted proteins encoded by the 5' 
ESTs, extended cDNAs, or fragments thereof, the purified proteins may be tested for the 
ability to bind to the surface of various cell types as described in Example 3 1 bdow. It will be 
30 appreciated that a plurality of proteins expressed fi-om these cDNAs may be included in a 
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panel of proteins to be simultaneously evaluated for the aaivities specifically described bdow. 
as well as other biological roles for which assays for determining activity are available. 

EXAMPLE31 

5 Analysis of Secreted t>mt«n3 to Determi ne Whether thev Bind to the Cell Sm fact; 

The proteins encoded by the 5' ESTs, extended cDNAs, or fragments thereof are 
cloned into expression vectors such as those described in Example 30. The proteins are 
purified by size, charge, immunochromatography or other techniques femiliar to those skilled 
in the art. Following purification, the proteins are labeled using techniques known to those 
10 skilled in the art. The labeled proteins are incubated with cells or cell lines derived fi-om a 
variety of organs or tissues to allow the proteins to bind to any receptor present on the cell 
surface. FoUowing the incubation, the cells are washed to remove non-spedfically bound 
protein. The labeled proteins are dctcaed by autoradiography. Alternatively, unlabeled 
proteins may be incubated with the cells and detcaed with antibodies having a detectable 
I S label, such as a fluorescent molecule, attached thereto. 

Specificity of ceD suifiu:e binding may be analyxed by conducting a competition 
analysis in which various amounts of unlabeled protein aie incubated along with the labeled 
proteia The amount of labeled protein bound to the cell surface decreases as the amount of 
competitive unlabeled protein increases. As a control, various amounts of an unlabeled 
20 protein unrelated to the labeled protein is included in some binding reactions. The amount of 
labeled protein bound to Ae cell surface does not decrease in binding reactions containing 
increasing amounts of unrelated unlabeled protein, indicating that the protein encoded by the 
cDNA binds specifically to the cell surface. 

25 As discussed above, secreted proteins have been shown to have a number of 

important physiological eflfects and. consequently, represent a valuable therapeutic resource 
The secreted proteins encoded by the extended cDNAs or portions thereof made accordina : 
Examples 27-29 may be evaluated to determine their physiological activities as descnbed 
below. 



30 
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EXAMPLE 32 

Assaying the Proteins Expressed from Extended cDNAs or Portions Thereof for Cytokine 
Ceil Proliferation or CeK Differentjation Activity 
As discussed aboye, secreted proteins may act as cytokines or may affea cellular 
5 proliferation or differentiatioa Many protein factors discovered to date, including all known 
cytokines, have exhiKted activity in one or more feaor dependent cell proliferation assays, 
and hence the assays serve as a convenient confirmation of cytokine activity. The activity of a 
protein encoded by the extended cDNAs is evidenced by any one of a number of routine 
fector dependent cell proliferation assays for cell lines including, without limitation, 32D, 
10 DA2, DAIG, TIO, B9, B9/1 1, BaF3, MC9/G, MT (preB NO. 2E8, RB5, DAI. 123. Tl 165. 
HT2, CTLL2, TF-1, Mo7c and CMK. The proteins encoded by the above extended cDNAs 
or portions thereof may be evaluated for their ability to r^;ulate T cell or thymocyte 
proliferation in assays such as those described above or in the following references, which arc 
incorporated herein by reference: Current Protocols in Immmology, Ed. by Coligane/oi ., 
15 Greene Publishing Associates and Wiley-Intersdence; Takai et al. J. ImmunoL 137:3494- 
3500, 1986., Bertagnolli et cd,, 1 ImmunoL 145:1706-1712. 1990., Bertagnolli et ai. Cell 
Immunol 133:327-341, 1991; Bertagnolli, et oL, J. ImmunoL 149:3778-3783, 1992, 
BovmanetoL^IImmunoL 152:1756-1761, 1994. 

In addition, numerous assays for cytokine production and/or the proliferation of 
20 ^leen ceDs, lymph node ceUs and thymocytes are known. These include the techniques 
disclosed in Current Protocols in Immunology, supra 1:3.12.1-3.12.14; and Schreiber In 
Current Protocols in Immunology, supra 1 : 6.8. 1-6.8.8. 

The proteins encoded by the cDNAs may also be assayed for the ability to regulate 
the proliferation and differentiation of hematopoietic or lymphopoietic cells. Many assays for 
25 such activity are familiar to those skilled in the art, including the assays in the following 
references, which are incorporated herein by reference: Bottomly et aL, In Current Protocols 
in Immunology., supra. 1 : 6.3.1-6 3.12,. deVries era/., 7. fip. A/(PdL 173:1205-1211, 1991. 
Moreau et al,. Nature 36:690-692, 1988; Greenberger et al., Proc. NatL Acad Set. USA 
80:2931.2938. 1983; Nordan, R., In Current Protocols in Immunology., supra. 1 : 66 I- 
30 6.6.5; Smith « al., Proc, Nati Acad. Sci. USA. 83:1857-1861, 1986; Bennett et aL, m 
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Current Protocols in Immunology supra 1 : 6 15. 1. Ciarietta et aL, In Current Protocob in 
Immunology, supra 1 : 6. 13. 1 . 

The proteins encoded by the cDNAs may also be assayed for their ability to r^aie 
T-cell responses to antigens. Many assays for such activity are fanuliar to those skilled in the 
5 art. including the assays described in the following references, which are incorporated herein 
by reference: Chapter 3 {In Vitro Assays for Mouse Lymphocyte Function). Chapter 6 
(Cytokines and Their Cellular Recq)tors) and Chapter 7, (Immunologic Studies in Humans) 
in Current Protocols in Immunology supra, Weinberger et al,, Proc. NatL AcadL Sci. USA 
77:6091-6095, 1980; Weinberger et aL, Eur J. Immuri 11:405-411, 1981; Takai et aL, J. 
10 ImmunoL 137:3494-3500, 1986; Takai etaLJ, ImmunoL 140:508-512. 1988. 

Those proteins which exhibit cytokine, cell proliferation, or cell difiFerentiation activity 
may then be formulated as pharmaceuticals and used to treat clinical conditions in which 
induction of cell proliferation or differentiation is beneficial. Alternatively, as described in 
1 5 more detail below, genes encoding these proteins or nucleic acids regulating the expression of 
these proteins may be introduced into appropriate host cells to increase or decrease the 
expression of the proteins as desired. 

EXAMPLE 33 

20 Assaying the Proteins Express ed from Extended cDNAs or Porrinns 

Thereof for Activitv as I mmune System Regulators 
The proteins encoded by the cDNAs may also be evahiated for their effects as 
immune regulators. For example, the proteins may be evahiated for thdr activity to influence 
thymocyte or splenocyte cytotoxicity. Numerous assays for such activity are femiliar to those 

25 skilled in the art inchiding the assays described in the following references, which are 
incorporated herein by reference: Chapter 3 {In Vitro Assays for Mouse Lymphocyte 
Function 3.1-3.19) and Chapter 7 (Immunologic studies in Humans) in Current Protocols in 
Immunology , Coligan et aL, Eds. Greene Publishing Associates and Wiley-Interscience; 
Hcmnann et aL, Proc, NatL Acad Sci. USA 78.2488-2492, 1981; Hemnann et aL, J. 

30 ImmunoL 128:1968-1974, 1982; Handa et al., J. ImmtmoL 135:1564-1572, 1985; Takai et 
al., J, ImmunoL 137:3494-3500. 1986; Takai et aL, J, ImmunoL 140:508-512. 1988; 
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Bowman etaL,J. Viroligy 61:1992-1998; Beitagnolli et al.. CelL Immu,K^ 133:327.341. 
1991; Brown etal., J. Immunol. 153:3079-3092. 1994. 

The proteins encoded by the cDNAs may also be evaluated for their eflfeas on T-cell 
dependent inununoglobulin responses and isotype switching. Numerous assays for such 
5 activity are femiliar to those skilled in the art, including the assays disclosed in the following 
references, which are incorporated herein by reference: Maliszewski. J. Immunol. 144:3028- 
3033, \9^MoTAetaI.\nCuirentProtacdsinImmmmlogy, 1 : 3.8.1-3.8.16, Ji(p«i. 

The proteins encoded by the cDNAs may also be evaluated for their eflfect on immune 
effiwor cells, inchiding their tSta on Thl cells and cytotoxic lymphocytes. Numerous assays 
10 for such activity are &niliar to those skilled in the art, including the assays disclosed in the 
foBowing references, which are incoiporated herein by reference: Chapter 3 {In Vitro Assays 
for Mouse Lymphocyte Function 3. 1-3. 19) and Chapter 7 (Immunologic Studies in Humans) 
in Current Proiocob in ImmuntOogy, supra, Takai et al. J. Immunol 137:3494-3500, 1986; 

T9kaietaL,J.ImimmoL 140:508-512, 1988; Beitagnolli e/dL.^ ImmwioL 149:3778-3783, 
15 1992. 

The protdns encoded by the cDNAs may also be evaluated for their eflfect on 
dendritic cell mediated activation of naive T-cells. Numerous assays for such activity are 
femiliar to those skiUed in the art, including the assays disclosed in the foUowing refeiences, 
which are incorporated herein by reference: Guery et al, J. Immunol 134:536-544, 1995; 

20 Inabaetal, J. Exp. Med 173 549-559. 1991; Macatoniae/dl. J. Immunol 154:5071-5079, 
1995; Porgador ei aU. Exp. Med 182:255-260, 1995; Nair et al, J. Virol 67:4062-4069, 
1993; Huang et al. Science 264:961-965. 1994; Macatonia et aU. Exp. Med 169:1255- 
1264, 1989; Bhafdwsg etaL, Journal of Clirucal Investigatitm 94:797-807. 1994; and Liaba 
etal, J. Exp. A/erf 172:63 1-640, 1990. 

25 The proteins encoded by the cDNAs may also be evaluated for their influence on the 

lifetime of lymphocytes. Numerous assays for such activity are ^miliar to those skilled in the 
art, including the assays disclosed in the foflowing references, which are incorporated herein 
by reference: Darzynkicwicz et al.. Cytometry 13:795-808. 1992; Gorczyca et al., Leukemia 
7:659-670, 1993; Gorczyca er o/. , Cancer Res. 53:1945-1951. 1993; ItohetaL, Ce// 66:233- 

30 243, 1991. Zacharchuk. J. Immunol. 145:4037-4045, 1990; Zamai et al. Cytometry 14:891- 
897, 1993; Gorczyca a/., Oncol 1:639-648. 1992. 
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The proteins encoded by the lONAs may also be evaluated for their influence on 
early steps of T<ett conmutment and development Numerous assays for such activity are 
familiar to those skiOed in the art, including without limitation the assays disclosed in the 
foUowing references, wduch are incorporated herein by references: Antica et aL, Bhod 
5 84:111-117, 1994; Fuie et aL, Cell Immunol. 155:111-122, 1994; Galy et ai. Blood 
SS:2TrO-277S, 1995; Told etaL, Proc Nat. AcadSci. USA 88:7548-7551, 1991. 

Those proteins which exhibit activity as immune system regulators activity may then 
be fonnulated as phannaceuticals and used to treat clinical conditions in which regulation of 
immune activity is beneficial. For example, the protein may be usefiil in the treatment of 
10 various immune deficiencies and disorders (inchiding severe combined immunodeficiency). 
e.g., in regulating (up or down) growth and proliferation of T and/or B lymphocytes, as weD 
as efiecting the cytolytic activity of NK cells and other cefl populations. These immune 
deficiencies may be genetic or be caused by viral (e.g., HIV) as well as baaerial or fungal 
infections, or may resuh fi-om autoimmune disorders. More specifically, infectious diseases 
15 caused by viral, baaerial, fiingal or other infection may be treatable usii« a piotein encoded 
by extended cDNAs derived from the 5* ESTs of the present invention, including infections 
by mv. hepatitis viruses, herpesviruses, mycobacteria, Leidmiania spp., plamodwm and 
various fimgal infections such as candidiasis. Of course, in this r^aid, a protein encoded by 
extended cDNAs derived fran the 5' ESTs of the present invention may also be usefiil where 
20 a boost to the immune system generally nuy be desirable, /.e., in the treatment of cancer. 

Akeraatively, proteins encoded by extended cDNAs derived from the 5' ESTs of the 
present invontkm may be used in treatment of autoimmune disorders induding, for example, 
connective tissue disease, multiple sclerosis, systemic hipus eiytfiematosus, riieumatoid 
arthritis, autoimmune puhnonary inflammation, Guillaift-Barre syndrome, autoimmune 
25 thyroiditis, insulin dependent diabetes mellftis. myasthenia gravis, grafl-versus-host disease 
and autoimmune inflammatory eye disease. Such a protein encoded by extended cDNAs 
derived fix)m the 5' ESTs of the present invention may also to be usefiil in the treatment of 
allergic reactions and conditions, such as asthma (paiticulariy allergic asthma) or other 
respiratory problems. Other conditions, in which immune suppression is deared (including. 
30 for example, organ transplantation), may also be treatable \istag a protein encoded b>' 
e-vtended cDNAs derived fcom the 5' ESTs of the presem invention. 
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Using the proteins of the invention it may also be possible to regulate immune 
responses either up or down. 

Down r^ation may involve inhibiting or blocking an immune response already in 
progress or may involve preventing the induction of an immune response. The fiinctions of 
5 activated T-ceils may be inhibited by suppressing T cell responses or by inducing specific 
tolerance in T cells, or both. Immunosuppression of T cell responses is generally an active 
non-antigen-specific process which requires continuous exposure of the T cells to the 
suppressive agent Tolerance, which involves inducing non-responsiveness or anergy in T 
ceDs, is distinguishable from immunosuppression in that it is generally antigen-spedfic and 

10 persists after the end of exposure to the tolenzing agent. Operationally, tolerance can be 
demonstrated by the lack of a T ceU response upon reexposure to specific antigen in the 
absence of the tolerizing agent. 

Down regulating or preventing one or more antigen functions excluding without 
limitation B fymphocyte antigen fimctions, such as, for example, B7 cosdmulation), e.g., 

1 5 preventing high level lymphokine synthesis by activated T cells, will be useful in situations of 
tissue, skin and organ transplantation and in grafl-versus-host disease (GVHD). For example, 
blockage of T cdl fimction should result in reduced tissue destruction in tissue 
transplantation. Typically, in tissue transplants, rqection of the transplant is initiated through 
its recognition as ftweign by T cells, followed by an immune reaction that destroys the 

20 transplant. The administration of a molecule which inhibits or blodcs interaction of a B7 
lymphocyte antigen with its natural ligand(s) on immune cells (such as a soluble, monomeric 
form of a peptide having B7-2 activity alone or in conjunction with a monomeric fonn of a 
pq>tide having an activity of another B lymphocyte antigen (e.g., B?-!, B7.3) or blocking 
antibody), prior to transplantation, can lead to the binding of the molecule to the natural 

25 ligand(s) on the immune cells without transmitting the correspondmg costimulatory signal. 
Blocking B lymphocyte antigen function in this matter prevents cytokine synthesis by immune 
cells, such as T cells, and thus aas as an immunosuppressant. Moreover, the lack of 
costimulation may also be sufficient to anergize the T ceUs, diereby inducii^ tolerance in a 
subject. Induction of long-term tolerance by B I>-mphocyte antigen-blocking reagents may 

30 avoid the necessity of repeated administration of these blocking reagents. To achieve 
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sufficient tmmunosuppression or tolerance in a subjea. it may also be necessary to block the 
function of a combination of B iymphocyte antigens. 

The efficacy of panicular blocidng reagents in preventing organ transplant rqection or 
GVHD can be assessed using animal modeb that are predictive of efficacy in humans. 
Examples of appropriate systems which can be used inchide alloseneic cardiac grafts in rats 
and xenogeneic pancreatic islet cell grafts in mice, both of which have been used to examine 
the immunosuppressive efifects of CTLA4Ig fiision proteins in vivo as described in Lenschow 
etal. Science 257:789-792. 1992 and Turka et ai, Proc. Natl. Acad. Sci USA, 89:11102- 
11105, 1992. In addition, murine models of GVHD (see Paul ed.. Fundamental 
Immunology, Raven Press. New York, 1989, pp. 846-847) can be used to determine the 
efltea of blocking B lymphocyte antigen fiuiction in vivo on die development of that disease. 

Blocking antigen function may also be therapeutically useful for treating autoimmune 
diseases. Many autoimmune disorders are the result of inappropriate activation of T cells that 
are reactive against sdf tissue and which promote the production of cytokines and 
15 autoantibodies imrolved in the pathology of the diseases. Preventing the activation of 
autoreactive T cells may reduce or diminate disease symptoms. Administration of reagents 
which block costimulation of T cells by disrupting receptor/ligand interactions of B 
lymphocyte ant^ens can be used to inhibit T ceO activation and prevent production of 
autoantibodies or T cell-derived cytokines which potentially involved in die disease process 
20 Additionally, bbcking reagents m^ induce antigen-spedfic tolerance of autoreactive T ceUs 
which couM lead to fong-tcnn relief fiom the disease. The efficacy of Mocking reagents in 
preventii« or alleviating autoimmune disorders can be determined using a number of weU- 
characterized animal models of human autoimmune diseases. Examples include murine 
experimental autwmmune encq)halitis, systemic lupus erythmatosis in MRL/pr/pr mice or 
25 NZB hybrid mice, murine autoimmuno collagen arthritis, diabetes mellitus in OD mice and 
BE rats, and murine experimental myasthenia gravis (see Paul ed., supra, pp. 840-856). 

Upregulation of an antigen fimction (preferably a B lymphocyte antigen fiinction). as 
a means of up regulating immune responses, may also be useful in Uierapy. Upregulation of 
immune responses may involve eitiier enhancing an existing immune response or eliciting ar. 
30 initial immune response as shown by the following examples. For mstance, enhancing ar 
immune response tiirough stimulating B lymphocyte antigen fiinction may be useful in case^ 
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of viral infection. In addition, systemic viral diseases such as influenza, the common cold, and 
encephalitis might be alleviated by the admirastration of stimulatory form of B lymphoc>'te 
antigens systemically. 

Alternatively, antiviral immune responses may be enhanced in an infected patient by 
5 removing T cells from the patient, costimulating the T cells in vitro with viral antigen-pulsed 
APCs either expressing a peptide encoded by extended cDNAs derived from the 5' ESTs of 
the present invention or together with a stimulatory fonm of a soluble peptide encoded by 
extended cDNAs derived from the 5' ESTs of the present invention and reinu-oducing tiie in 
vitro primed T cells into the patient. The infeaed cells would now be capable of defivcring a 

10 costimulatory signal to T cells in vivo, thereby activating the T cells. 

In another application, upregularion or enhancement of antigen fiinction (preferably B 
lymphocyte antigen fiinction) may be useful in the induction of tumor immunity. Tumor cells 
(e.g., sarcoma, melanoma, lymphoma, leukemia, neuroblastoma, carcinoma) Iransfeaed with • • 
a nucleic add encoding at least one pq)tide encoded by extended cDN As derived from the 5 * 

15 ESTs of the present invention can be administered to a subject to overcome tumor-spedfic 
tolerance in the subject Ifdesired, the tumor cell can be transfected to esqpress a combination 
of peptides. For example, tumor cells obtained from a patient can be transfected ex vivo with 
an expression vector directing the expression of a peptide having B7.2-like activity alone, or 
in conjunction with a peptide having BT-l-like activity and/or BT-S-like activity. The 

20 ^ transfected tumor cells are returned to the patient to resuh in expression of the peptides on 
the surfece of the transfected cell Alternatively, gene therapy techniques can be used to 
target a tumor cell for transfcction in vivo 

The presence of the peptide encoded by extended cDNAs derived from the 5' ESTs 
of the presem invention having the activity of a B lymphocyte antigen(s) on the sur&ce of the 

25 tumor cell provides the necessary costimulation signal to T cells to induce a T cell mediated 
immune response against the transfected tumor cells. In addition, tumor cells which lack or 
which fail to reexpress sufficient amounts of MHC class I or MHC class H molecules can be 
transfeaed with nucleic adds encoding all or a portion of (e.g., a cytoplasmic-domain 
tmncated portion) of an MHC class I a chain and p2 microglobulin or an MHC class n a 

30 chain and an MHC class U (J chain to thereby express MHC class I or MHC class n protd 
on the cell surface, respectively. Expression of the appropriate MHC class I or class II 
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molecuies in conjunction with a peptide having the activity of a B lymphocyte antigen (e.g.. 
B7-1, B7-2, B7-3) induces a T ceil mediated immune response against die transacted tumor 
cell. Optionally, a gene encoding an antisense construct which blocks ejq)ression of an MHC 
class n associated protein, such as the invariant chain, can also be cotransfected with a DNA 
5 encoding a pqitide having the activity of a B lymphocyte antigen to promote presentation of 
tumor associated antigens and induce tumor spedSc immunity. Thus, the induction of a T 
ceU mediated immune reqxmse in a human subjea m^ be suflBdert to overcome nimor- 
spedfic tolerance in the subject. Aheniatively, as described in more detail below, genes 
encoding these immune system regulator proteins or nucleic adds regulating the expression of 
10 such protons may be introduced into appropriate host ceOs to increase or decrease the 
repression of the proteins as desired. 

EXAMPLE 34 
Assaying the Protons Expressed from Extended cDNA-q 
1^ or Portions Thereof for He matonoiegs Ri py tiatino A^^ti^^y 

The proteins encoded by the extended cDNAs or portions thereof m^ also be 
evahiated for thdr hematopcxesis r^atii^ activity. For exanspk, the effect of the proteins 
on embryonic stem cell differentiation may be evaluated. Numerous assays for such activity 
are ^miliar to those skiOed in the art, induding the assays disclosed in the foOowit^ 
20 references, which are incorporated herein by reference: Johansstm et aL Cell BkO. 15:141- 
151, 1995; Keller et al., MoL Cell. Biol. 13:473-486, 1993; McClanahan e/ cd.. Blood 
81:2903-2915, 1993. 

The proteins encoded by the extended cDNAs or portions thereof may also be 
evaluated for thdr influence on the lifetime of stem cells and stem ceU diflferentiation. 

25 Numerous assays for such aaivity are famiiiar to those skilled in the art, including the assays 
disclosed in the following references, which are incorporated herein by reference: Freshney. 
Methylcellulose Colony Forming Assays, in Culture of Hematopoietic Cells., Freshney, et ai 
Eds. pp. 265-268. Wiley-Liss, Inc., New York, NY. 1994; Hirayama etoL. Proc NatL Acad. 
Sci. USA 89:5907-5911, 1992; McNiece and Briddefl, in Culture of Hematopoietic Cells. 

30 supra, Neben et al., Exp. Hematol. 22:353-359, 1994; Ptoemacher and Cobblestone In 
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Culture of Hematopoietic Cells, supraUll Spooncer et al, in Culttire of Hematopoietic 
Cells, Jipml63-179 and Sutheriand in Culture of Hematopoietic Cells, supra. 139-162. 

Those proteins which exhibit hematopoiesis r^atory activity may then be 
formulated as phaimaceuticals and used to treat clinical conditions in which regulation of 
5 hematopoeisis is beneficial, such as in the treatment of myeloid or lymphoid cell deficiencies 
Involvement in regulating hematopoiesis is indicated even by marginal biological activity in 
support of colony fomiing cells or of fector-dependent cell lines. For example, proteins 
supporting the growth and proliferation of erythroid progenitor cells alone or in combination 
with other cytokines, indicates utility, for example, in treating various anemias or for use in 
10 conjunction with irradiation/chemotherapy to stimulate the production of erythroid precursors 

and/or erythroid cells. Proteins supporting the growth and prolifisration of myeloid cells such ^ 
as granulocytes and monocytes/macrophages (i.e., tracfitional CSF activity) may be useful, for 
example, in conjunction with chemotherapy to prevent or treat consequent myelo- > 
suppression. Proteins supporting the growth and proliferation of megakaryocytes and 
15 consequendy of platelets allows prevention or treatment of various platelet disorders such as 
thrombocytopenia, and generally may be used in place of or complementary to platelet 
transfiisions. Proteins supporting the growth and proliferation of hematopoietic stem cells 
which are capable of maturing to any and all of the above-mentioned hematopoietic cells may 
therefore find therapeutic utility in various stem cell disorders (such as those usually treated 
20 with transplantion, including, without limitation, aplastic anemia and paroxysmal nocturnal 
hemoglobinuriaX as well as in repopulating the stem cell compartment post 
irradiation/chemotherapy, either in vivo or ex vivo (/.e., in coryunction with bone manx)w 
. transplantation or with peripheral progenitor cell transplantation (homologous or 
heterologous)) as normal cells or genetically manipulated for gene therapy. Alternatively, as 
25 described in more detail below, genes encoding hematopoiesis regulating activity proteins or 
nucleic acids regulating die expression of such proteins may be introduced into appropriate 
host cells to increase or decrease the expression of the proteins as desired. 

EXAMPLE 35 

30 Assaving the Proteins Exp ressed from Extended cDNAs or Portions Thereof 

for Regulation of Tissue Growth 
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The proteins encoded by the extended cDNAs or portions thereof may also be 
evaluated for their effect on tissue growth. Numerous assays for such activity are familiar to 
those skilled in the an, including the assays disclosed in International Patent Publication No. 
WO95/16035, International Patent Publication No. WO95/05846 and International Patent 
5 PubUcation No. W09 1/07491. which are incorporated herein by reference. 

Assays for wound healing activity include, without limitation, dwse described in: 
Winter, Epidermal Wound Healing, pps. 71-112. Maibach and Rovee. eds.. Year Book 
Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz. 7. Invest, DermatoL 
71:382-84, 1978, which are incorjXM^ed herein by reference. 
10 Those protons which are involved in the regulation of tissue growdi may then be 

formulated as phannacaiticals and used to treat clinical conditions in which regulation of 
tissue growth is beneficial. For ©cample, a protein encoded by extended cDNAs derived from 
the 5' ESTs of the present invention also may have utility in compositions used for bone, 
cartilage, tendon, ligament and/w n«ve tissue growth or regeneration, as well as for wound 
15 healing and tissue repair and replacement, and in the treatment of bums, incisions and ulcers. 

A protein encoded by extended cDNAs derived from the 5' ESTs of the present 
invention, which induces cartilage and/or bone growth in circumstances where bone is not 
noimally formed, has application in die healing of bone fractures and cartilage damage or 
defects in humans and other animals. Such a preparation employing a proton of the invention 
may have prophylactic use in closed as well as open fracture reduction and also in the 
improved fixation of artificial joints De novo bone synthesis induced by an osteogenic agent 
contributes to the repair of corigcnital, trauma induced, or oncologic resection induced 
craniofridal defects, and also is useful in cosmetic plastic surgery. 

A protein of this invention may also be used in the treatment of periodontal disease. 
25 and in other tooth repair processes. Such agents may provide an environment to attract bone- 
forming cdls, stimulate growth of bone-forming cells or induce differentiation of bone- 
fonning cell progenitors. A protein of the invention may also be usefiil in the treatment of 
osteoporoas or osteoarthritis, such as through stimulation of bone and/or cartilage repair or 
by blocking inflammation or processes of tissue destruction (coUagenase aaiviry, osteoclast 
30 activity, etc.) mediated by inflammatory processes. 
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Another category of tissue regeneration activity that may be attributable to the protein 
encoded by extended cDNAs derived from the 5' ESTs of the present invention is 
tendon/ligament formation. A protein encoded by extended cDNAs derived from the 5* 
ESTs of the present invention, which induces tendon/ligament-like tissue or other tissue 
5 formation in circumstances where such tissue is not normally foniied. has application in the 
healing of tendon or ligament tears, deformities and other tendon or ligament defects in 
humans and other animals. Such a preparation employing a tendon/Bgament-like tissue 
inducing protein may have propl^lacric use in preventing damage to tendon or ligament 
tissue, as wdl as use in the improved fixation of tendon or ligament to bone or other tissues, 

10 and in repairing defects to tendon or ligament tissue. De novo tendon/ligament-like tissue 
formation induced by a composition encoded by extended cDNAs derived from the 5' ESTs 
of the present invention contributes to the repair of tendon or ligaments defeas of congenital, 
traumatic or otiier origin and is also useful in cosmetic plastic surgery for attachment or repair 
of tendons or ligaments. The compositions encoded by extended cDNAs derived from the 5 * 

15 ESTs of the present invention may provide an environment to attract tendon- or ligament- 
forming cells, stimulate growth of tendon- or ligament-forming cells, induce differentiation of 
progenitors of tendon- or ligannent-forming cells, or induce growth of tendon/ligament cells 
or progenitors ex vjv9 for return in vivo to effea tissue repair. The compositions of the 
invention may also be useful in the treatment of tendinitis, carpal tunnel syndrome and other 

20 tendon or ligament defects. The compositions m^ also include an appropriate matrix and/or 
sequestering agent as a carrier as is well known in the art 

The protein encoded by extended cDNAs derived from the 5' ESTs of the present 
invention may also be usefiil for proliferation of neural cdls and for regeneration of nerve and 
brain tissue, /.e., for the treatment of central and peripheral nervous system diseases and 

25 neuropathies, as well as mechanical and traumatic disorders, which involve d^eneration. 
death or trauma to neural cells or nerve tissue. More specifically, a protein may be used in the 
treatment of diseases of the peripheral nervous system, such as peripheral nerve injuncs. 
peripheral neuropathy and localized neuropathies, and central nervous qrstem diseases, such 
as Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis. anJ 

30 Shy-Drager syndrome. Further conditions which may be treated in accordance with 'Jw 
present invention include mechanical and traumatic disorders, such as spinal cord disorders 
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head trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies resulting 
from chemotherapy or other medical therapies may also be treatable using a protein of the 
invention. 

Proteins of the invention may also be useful to promote better or fester closure of 
5 non-healing vvounds, including without limitation pressure ulcers, ulcers associated with 
vascular insufficiency, surgical and traumatic wounds, and the like. 

It is expected that a protein encoded by extended cDNAs derived from the 5' ESTs 
of the present invention may also exhibit activity for generation or regeneration of other 
tissues, such as organs (including, for example, pancreas, liver, intestine, kidney, skin, 
10 endothelium) muscle (smootii, skeletal or cardiac) and vascular including vascular 
endotheBum) tissue, or for promoting the growth of cells comprising such tissues. Part of the 
desired effects may be by inhibition or modulation of fibrotic scarring to allow nonnal tissue 
to generate. A protein of the invention may also exhibit angiogenic activity. 

A protein encoded by extended cDNAs derived from the 5' ESTs of the present 
1 5 invention may also be usefiil for gut protection or regeneration and treatment of hmg or liver 
fibrosis, reperfusion injury in various tissues, and conditions resulting from systemic cytokine 
damage. 

A protein encoded by extended cONAs derived from the 5' ESTs of the present 
invention m^ also be useful for promoting or inhibiting differentiation of tissues described 
20 above from precursor tissues or cells; or for inhibiting the growth of tissues described above. 

Ahematively, as described in more detail bdow, genes encoding tissue growth 
regulating activity proteins or nucleic acids relating tiie expression of such proteins may be 
introduced into appropriate host cells to bcrease or decrease the expression of the proteins as 
desired. 

25 

EXAMPLE 36 

Assaving the Prot eins Expressed fix)m Extended cDNAs or Portions 
Thereof fo r Regulation of Reproductive Hormones 
The proteins encoded by the extended cDNAs or portions thereof may also be 
30 evaluated for their ability to regulate reproductive hormones, such as follicle stimulating 
hormone. Numerous assays for such activity are 6miliar to tiiose skilled in the art, including 
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the assays disclosed in the following references, which are incorporated herein by reference: 
Vale et aL, Endocrinol 91:562-572, 1972; Ling et oL^ Nature 321:779.782, 1986; Vale et 
aL, Nature 321:776-779, 1986; Mason et ai ,. Nature 318:659-^3, 1985; Forage et aL. 
Proc. Nail. Acad ScL USA 83:3091-3095, 1986. Chapter 6.12 in Curreni Protocols in 
5 Immunology, Coligan et aL Eds, Greene Publishing Associates and Wiley-taersciece ; Taub 
etaL.J. ClinlnvesL 95:1370-1376. 1995; Linde/ail.>lPMS 103:140-146. 1995;Mullere/ 
al.. Eur. J, Immunol 25:1744-1748; Gnibcr et al, J. Immwiol 152:5860-5867. 1994; 
Johnston «r a/., J Immunol 153:1762-1768, 1994. 

Those protdns wKch exhibit activity as reproductive hormones or regulators of cell 

10 movement may then be formulated as pharmaceuticals and used to treat clinical conditions in 
which regulation of reproductive hormones are beneficial. For example, a protein encoded by 
extended cDNAs derived from the 5' ESTs of the present invention may also exhibit activin- 
or inhibin-related activities. Inhibins are charaaerized by their ability to inhibit the release of 
follicle stimulating hormone (FSH), while activins are characterized by their abilhy to 

15 stimulate the release of FStt Thus, a protein encoded by extended cDNAs derived from the 
5' ESTs of the present invention, alone or in heterodimers with a member of the inhibin a 
family, may be useful as a contracep ti ve based on the ability of inhibins to decrease fertility in 
female mammals and decrease spermatogenesis in male mammals. Administration of 
sufBcient amounts of other inluUns can induce inferdlity in these mammals. Ahematively. the 

20 protein of the invention, as a homodimcr or as a heterodimer with other protein subunits of 
the inWbin-B group, nuqr be useful as a fertility inducing therapeutic, based upon the ability of 
activin radecules in sdmuladng FSH release from cdls of the anterior pituitary. See, for 
example, United States Patent 4,798,885, the disclosure of which is incorporated herein by 
reference. A protein of the invention may also be useful for advancement of the onset of 

25 fertility in sexually immature mammals, so as to increase the fifetime reproductive 
performance of domestic animals such as cows, sheep and pigs. 

Alternatively, as described in more detail below, genes encoding reproduaive 
hormone regulating activity proteins or nucleic acids regulating the expression of such 
proteins may be introduced into appropriate host ceUs to increase or decrease the expression 

30 of the proteins as desired. 
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EXAMPLE37 

Assaying the Proteins Exnresse d from Extended ctHUA^ 
Portions The reof for Chemotactic/Chemoldnetic Ac^ ^ 
The proteins encoded by the extended cDNAs or portions thereof may also be 
5 evaluated for chemotactic/chcmokinetic activity. For example, a protein encoded by 
extended cDNAs derived from the 5' ESTs of the present invention may have chemotactic or 
chemoldnetic activity (e.g., act as a chemofcine) for mammalian ceUs, including, for example, 
monocytes, fibroblasts, neutrophils, T-cdls, mast cells, eosinophils, epithelial and/or 
endothelial ceDs. Chemotactic and chemokinetic proteins can be used to mobilize or attract a 
10 desired cell population to a desired site of action. Chemotactic or chemokinetic proteins 
provide particular advantages in treatment of wounds and other trauma to tissues, as well as 
in treatment of localized infections. For example, attraction of lymphocytes, monoqrtes or 
neutrophils to tumors or sites of infection may result in improved immune responses against 
the tumor or infecting agent. 
1 5 A protein or peptide has chemotactic activity for a particular cell population if it can 

stimulate, directly or indirectly, the directed orientation or movement of such cell populatran. 
Preferably, the protein or peptide has the ability to directly stimulate directed movement of 
cdls. Whether a particular protein has chemotactic activity for a population of cells can be 
readily determined by employing such protdn or peptide in any known assay for cell 
20 chemotaxis. 

The activity of a protein of the invention may, among other means, be measured by 
the following methods: 

Assays for chemotactic activity (which will identify proteins that induce or prevent 
chemotaxis) consist of assays diat measure the ability of a protein to induce the migration of 

25 cells across a membrane as well as the ability of a protein to induce the adhesion of one cell 
population to another cell population. Suitable assays for movement and adhesion include, 
without limitation, those described in: Current Protocols in Immunology, Ed by Coligan, 
Kiuisbeek, Margulies, Shevach and So-ober, Pub. Greene Publishing Associates and Wiley- 
Intcrscicncc, Chapter 6.12: 6.12.1.6.12.28; Taub etaL, J. Clin. Invest 95:1370-1376. 1995; 

30 Lind et al,, APMIS 103:140-146. 1993; Mueller et al.. Eur. J. Immunol 25:1744-1748; 
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Gniber e/ ai. J. Inummpl. 152:5860-5867. 1994; Johnston et aL J. Immunol , 153:1762- 
1768, 1994. 



EXAMPLE 38 

^ Assaying the Proteins Fxp ressed from Extended cHN/^ ff nr 

Portions Thereof for Re^il: ^tion of Blond rinnino 
The proteins encoded by the extended cDNAs or portions thereof may also be 
evaluated for their eflFects on blood clotting. Numerous assays for such activity are familiar to 
those sidUed in the art, including the assays disclosed in the following references, which ar« 
1 0 incorporated herein by reference: Linet et al., J. Clin. PharmacoL IS: 13 1-140, 1986; Burdick 
et al. Thrombosis Res. 45:413-419, 1987; Huraphicy et aL, Fibrinolysis 5:71-79. 1991, 
Sdiaub. Prostaglandins 35:467-474, 1988. 

Those proteins which are involved in the r^ulation of blood clottii^ may then be 
foimuiated as phannaceuticals and used to treat clinical conditions in which regulation of 
15 blood clotting is beneficial. For example, a protein of the invendon may also exhibit 
hemostatic or Uirombolytic activhy. As a tesuk. such a protein is expected to be use&l m 
treatment of various coagulations disorders Onduding heieditaiy disorders, such as 
hemophilias) or to enhance coagulation and other hemostatic events in treating wounds 
resulting fixsm trauma, surgery or odier causes. A protein of the invention may also be useful 
20 fi)r dissohring or inWbiting formation of thromboses and for treatment and prevention of 
concfitions resdting thMefix)m (such as infarction of cardiac and central nervous system 
vessels (e.g., stroke)). Ahematively, as described in more detail below, genes encoding blood 
clotting activity proteins or nucleic acids regulating die expression of such protans may be 
introduced into appropriate host cells to increase or deaease die expression of die protons as 
25 desired. 



EXAMPLE 39 
Assaying the Proteins Ex pressed from Extended cDN At or 
Pprrions Thereof for Involvement in Receptor/Litand Interacrinns 
The proteins encoded by die extended cDNAs or a portion thereof may also be 
evaluated for their involvement in receptor/ligand interactions. Numerous ass^ for such 
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20 



involvement .are finiiliar to those sidl ed m the art, including the assays disclosed in the 
foUowing references, which are incorporated herein by reference: Chapter 7. 7.28.1-7.28.22 
in Current Protocols in Immunology. CoUgan et aL Eds Greene Publishing Associates and 
Wdey-Interscience; Takai etal., Proc. NatL Acad ScL USA 84:6864-6868, 1987; Bieter e/ 
aL, J. Exp. Med 168:1145-1156. im, Koseastan et aL. J. £j:p. Med 169:149.160. 1989. 
Stoltenborg etaL. J. Immunol Methods 175:59-68. 1994; Stitt etal. Ceff 80:661 -670. 1995; 
Gyurise/ai.Ce// 75.791-803. 1993. 

For example the proteins encoded by extended cDNAs derived from the 5' ESTs of 
the present invention may also demonstrate activity as receptors, receptor figands or inhibitors 
or agonists of receptoryiigand interactions. Examples of such receptors and ligands indude. 
without limitation, cytokine receptors and their ligands. receptor kinases and their ligands. 
receptor phosphatases and their ligands. receptors involved in ccB-ceU interactions and their 
ligands Cmchiding without limitation, cellular adhesion molecules (such as sdectins. integrins 
and their ligands) and receptor/ligand pairs invohred in antigen presentation, antigen 
recognition and development of ceUular and humoral immune responses). Receptors and 
ligands are also usefiil for screening of potential peptide or smaU molecule inhibitors of the 
relevant receptor/Kgaod interaction. A protein encoded by extended cDNAs derived from the 
5* ESTs of the present invention (mcluding, without limitation, fragments of receptors and 
ligands) may themselves be usefiil as inhibitors of receptor/ligand interactions. Alternatively, 
as described in more detail below, genes encoding proteins invoked in receptor/ligand 
interactions or nuddc acids regulating the expression of such proteins may be intrxxhiced into 
qjpropriate host cdb to increase or decrease the expression of the proteins as desired. 



EXAMPLE40 

Assaying the Proteins Expressed from Extended cDNAs or Portions Theienf 
fiy Ami-Inflammatorv Acrivitv 

The proteins encoded by the extended cDNAs or a portion thereof may also be 
evahjated for ami-inflaimnatoiy activity. The anti-inflammatory activity may be achieved by 
providing a stimulus to cdls involved in the inflammatory response, by inliibiting or 
promoting cell-cell interactions (such as, for example, cell adhesion), by inhibiting or 
promoting chemotaxis of cells involved in the inflammatory process, inhibiting or promoting 
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cdl extravasation, or by stinniladng or suppressing production of other factors which more 
directly inhibit or promote an inflanmiatoiy response. Proteins exhibiting such activities can 
be used to treat inflammatory conditions including chronic or acute conditions, including 
without limitation inflammation associated with infection (such as septic shock, sepsis or 
systemic inflammatofy response syndrome), ischemia-reperfusioninury. endotoxin lethality, 
arthritis, complement-mediated hyperacute rgecdon. nephritis, cytokine- or chemokine- 
induced lung injury, inflammatoty bowel disease. Crohn's disease or resulting from over 
production of cytokines such as TNF or E^l. Proteins of the invention may also be use&J to 
treat anaphylaxis and hypersensitivity to an antigenic substance or material. Alternatively, as 
described in more detail bdow, genes encoding anti-inflammatory activity proteins or nucleic 
adds noting the expression of such proteins may be introduced into appropriate host ceUs 
to increase or decrease the oqwession of the proteiis as desired. 



EXAMPLE 41 

AffflV*^ **" Proteins depre ssed from Extended cDNA..; nr 
Portions T hereof for Tumor Inhibhion Activj^ y 
The proteins encoded by the extended cDNAs or a portion thereof may also be 
evaluated toe tumor inhibition activity. In addition to the acrivhies described above for 
immunological treatment or prevention of tumors, a protein of the invention may exhibit other 
20 anti-tumor activities. A protein may inhibit tumor growth direcdy or indirecdy (such as. for 
example, via ADCC). A protein may exhibit hs tumor inhibhory activity by acting on tumor 
tissue or tumor precursor tissue, by inhibiting formation of tissues necessary to support tumor 
growth (such as. for example, by inhibiting angiogenesis). by causing production of other 
feaors. agents or ceU types which inhibit tumor growth, or by suppressing, eliminating or 
25 inhibiting factors, agents or ceU types which promote tumor growth. Alternatively, as 
described in more detail bdow. genes tumor inhibition activity proteins or nucleic adds 
regulating the expresaon of such proteins may be introduced into appropriate host ceils to 
increase or decrease the expression of the proteins as desired 

30 A protein of the invention may also exhibit one or more of the foUowing additional 

activities or eflFects: inhibiting the growth, infection or fimction of, or killing, infectious agents. 
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including.. Without limitation, bacteria, viiuses. fungi and other parasites; eSecting 
(suppressing or enfaandng) bodily charaaeristics. including, without limitation, height, 
weight, hair color, eye color, skin, fat to lean ratio or other tissue pigmentation, or organ or 
body part size or shape (such as, for example, breast augmentation or diminution, change m 
5 bone form or shape); eficting biorfaythms or Orcadian cycles or rhythms; effecting the 
fertility of male or female subjects; efl^ng the metabolism, catabolism, anabolism. 
processing, utilization, storage or eiiroination of dictaiy fet, lipid, protein, carbohydrate, 
vitamins, minerals, cofectors or other nutritional factors or component(s); efiFecting behavioral 
characteristics, inchiding, without limitation, appetite, libido, stress, cognition fmcluding 

10 cognitive disorders), depression (including depressive disorders) and violent behaviors; 
providing analgesic effects or other pain reducing effects; promoting differentiation and 
growth of embryonic stem cells in line^es other than hematopoietic lineages; hormonal or 
endocrine activity; m die case of enzymes, correcting deficiencies of the enzyme and tivating 
deficiency-related diseases; treatment of hyperproliferativB disorders (such as, fbr example. 

1 5 psoriasis); immum^obulin-like activity (such as. for example, the ability to bind amigens or 
complenjent); and the ability to act as an antigen in a vaccine composition to raise an immune 
re^nse against such proton or another material or entity v/tich is ooss-reactive with such 
proteia Ahematively, as described in more detail below, genes encocfii^ proteins involved in 
aiy of the above mentioned activities or nudeic adds r^uiating die expression of such 

20 proteins may be introduced into appropriate host cells to increase or decrease the expression 
of the i»'oteins as desired. 



EXAMPLE 42 

Identificatio n of Proteins which Interact with 
25 PohT)eDrides Encoded bv Extended cDNAs 

Proteins which interart with the polypeptides encoded by cDNAs derived fi-om the 5' 
ESTs or fi^gments thereof such as receptor proteins, may be identified using two hybrid 
systems such as the Matchmaker Two Hybrid System 2 (Catatag No. K1604-I, Ctontech). 
As described in the manual accompanying tiie kit which is incorporated herein by reference. 
30 Uie the cDNAs derived fiom 5' ESTs. or fragments tiiereo£ are inserted into an expression 
veaor such tiiat they are in frame with DNA encoding the DNA binding domain of Uw yeast 
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tnmscriptional activator GAM. cDNAs in a cDNA libraiy which encode proteins which 
might interaa with the polypeptides encoded by the extended cDNAs or portions thereof are 
insened into a second expression vector such that they are in frame with DNA encoding the 
activation domain of GAL4. The two expression plasmids are transformed into yeast and the 
yeast are plated on selection medium which selects for expression of selectable markers on 
each of the expression vectors as weU as GAL4 dependent expression of the fflS3 gene. 
Transfoimants capable of growing on medium lacking histidine are screened for GAL4 
dependent lacZ expression. Those cells which are positive in both the histidine selection and 
the lacZ assay contain plasmids encoding proteins which interact with the polypeptide 
encoded by the extended cDNAs or portiwis thereof 

Alternatively, the system described in Lustig et al., Meihods in Emymotogy 283: 83- 
99. 1997. and in U.S. Patent No. 5.6H150. the disclosure of which is incorporated herein by 
reference, may be used for identifying molecules »*ich inteiact with the polypeptides 
encoded by extended cDNAs. In such systems. //ivi/ro transcription reactions are peifonned 
15 on a pool of vectors containing extended cDNA inserts doned downstream of a promoter 
which drives in vitro tianscriptioa The resulting pools of mRNAs are introduced into 
Xenopuslaevis oocytes. The oocytes are then assayed for a desired activity. 

Aheniatively, the pooled in vitro transcription products produced as described above 
may be translated in vitro. The pooled in vitro translation products can be assayed for a 
20 desired activity or for interaction with a known polypeptide. 

Proteins or other molecules interacting with polypeptides encoded by extended 
cDNAs can be found by a variety of additional techniques. In one method, afBnity 
columns containing the polypeptide encoded by the extended cDNA or a portion thereof 
can be constructed. In some versions, of this method the affinity column contains 
25 chimeric proteins in which the protein encoded by the extended cDNA or a portion 
thereof is fused to ghitathione S-transferase. A mixture of ceUular proteins or pool of 
expressed proteins as described above and is applied to the affinity column. Proteins 
interacting with the polypeptide attached to the column can then be isolated and analyzed 
on 2-D electrophoresis gel as described in Ramunsen et al.. Electrophoresis 18:588-598, 
30 1997. the disclosure of which is incorporated herein by reference. Ahematively, the 
proteins retained on the affinity column can be purified by electrophoresis based methods 
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and sequenced. The same method can be used to isolate antibodies, to screen phage 
display products, or to screen phage display huntan antibodies. 

Proteins interacting with polypeptides encoded by extended cDNAs or portions 
thereof can also be screened by using an Optical Biosensor as described in Edwards and 
Leatheibarrow. Analytical Biochemistry 246:1-6. 1997, the disclosure of which is 
incorporated herein by reference. The main advantage of the metiiod is that it aUows the 
determination of the association rate between the protein and other interacting 
molecules. Thus, it is possible to specifically select interacting molecules with a high or 
low association rate. Typically a target molecule is linked to the sensor surfece (through 
a carboxymethl dextran matrix) and a sample of test molecules is placed in contact with 
the target molecules. The binding of a test molecule to tiie target molecule causes a 
change in the refiactive index and/ or thickness. This change is detected by tiie 
Biosensor provided it occurs in the evanescent field (which extend a few hundred 
nanometers from die sensor surface). In these screening assays, the target molecule can 
15 be one of the polypeptides encoded by extended cDNAs or a portion thereof and the test 
sample can be a collection of proteins extracted from tissues or cells, a pool of expressed 
proteins, combinatorial peptide and/ or chemical libraries, or phage displayed pq)tides. 
The tissues or ceUs from which the test proteins are extracted can originate from any 
^ecies. 

In other metiiods, a target protein is immobilized and die test population is a 
collection of unique polypeptides encoded by the extended cDNAs or portions thereof 

To study the interaction of the proteins encoded by ti»e extended cDNAs or 
portions titereof widi drugs, the microdialysis coupled to HPLC metirod described by 
Wang et al., Chromatograpfaa 44:205-208, 1997 or the affinity capillary electrophoresis 
25 method described by Busch et al., J. Chromaiogr 777:3 1 1-328, 1997. the disclosures of 
which are incorporated herein by reference can be used. 



20 



It win be appreciated by those skilled in the an that tiie proteins expressed from the 
extended cDNAs or portions may be assayed for numerous activities m addition to those 
specifically enumerated above. For example, the expressed proteins may be evaluated for 
applications involving control and regulation of inflammation, tumor proliferation or 
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metastasis, infection, or other clinical conditions. In addition, the proteins expressed 6om the 
extended cDNAs or portions thereof may be use&l as nutritional agents or cosmetic agents. 

nie proteins expressed from the cDNAs or portions thereof may be used to generate 
antibodies capable of specifically binding to the expressed protein or fragments thereof as 
described in Example 40 below. The antibodies may capable of binding a fiiU length protein 
encoded by a cDNA derived from a 5' EST, a mature protein (Le. the protein generated by 
cleavage of the signal peptide) encoded by a cDNA derived from a 5' EST, or a signal 
peptide encoded by a cDNA derived from a 5' EST. Alternatively, the antibodies may be 
capable of binding fragments of at least 10 amino acids of die proteins encoded by the above 
cDNAs. In some embodimems, the amibodies may be capable of binding fiagmems of at 
least 15 amino adds of die proteins encoded by the above cDNAs. In other embodiments, 
the antibodies may be capable of binding fragments of at least 25 amino adds of the proteins 
expressed fix>m the extended cDNAs which comprise at least 25 amino adds of the proteins 
encoded by die above cDNAs. In fiirther embodiments* the antibodies nuy be capable of 
15 bindii«fragn«ntsofat least 40 amino adds ofthe proteins eticoded by the above dJNAs. 

EXAMPLE 43 

Production ofan Antibody tft a HimanPmtein 
Sidwtantially pure protein or polypeptide is isolated from the transfected or 
20 ttansftamed ceOs as described in Example 30. The concentration of protein in the final 
preparation is adjusted, for example, by concentration on an Araicon filter device, to the levd 
of a few Mg/mL Monodonal or polydonal antibody to the protein can then be prepared as 
follows: 

1 Monoclonal Antibody P roduction bv Hvbridoma Fusion 

25 Monodonal antibody to epitopes of any of the peptides identified and isolated as 

described can be prepared from murine hybridomas according to the dassical method ot 
Kohler, and ^GIstein, Nature 256:495. 1975 or derivative methods thereof Briefly, a 
mouse is repetitively inoculated with a few micrograms of the selected protein or 
peptides derived therefrom over a period of a few weeks. The mouse is then sacrificed. 

30 and the antibody producing cells of the spleen isolated. The spleen cdls are fiised bv 
means of polyethylene glycol with mouse myeloma cells, and the excess unfiised cells 
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destroyed by growth of the system on selective media comprising aminopterin (HAT 
media). The successfully fused ceUs are diluted and aUquots of the dilution placed in 
wells of a microliter plate where growth of the culture is continued. Antibody-producing 
clones are identified by detection of antibody in the supernatant fluid of the wells by 
immunoassay procedures, such as ELISA, as originally described by Engvall. Meth. 
Enzymol. 70:419. 1980. the disclosure of which is incorporated herein by reference and 
derivative methods thereof Selected positive clones can be expanded and their 
monoclonal antibody product harvested for use. DetaUed procedures for monoclonal 
antibody production are described in Davis et a!, in Basic Methods in Molecular Biology 
Elsevier. New York. Sealon 21-2, the disclosure of which is incorporated herein by 
reference. 



2. Polyclonal Antibody Production bv ImmuniTatin,, 

Polyclonal andseium containing antibodies to heterogenous epitopes of a single 

15 protein can be prepand by immunizii^ suitable animals with the expressed protein or 
peptides derived thereftora, which can be unmodified or modified to enhance 
immunogenidty. Eflfective polyclonal antibody production is afiFeaed by many factors related 
both to the antigen and the host species. For example, smaU molecules tend to be less 
immunogenic than others and may require the use of carriers and adjuvant. Also, host 

20 animals response vary depending on site of inoculations and doses, with both inadequate or 
excessive doses of antigen resuhing in low titer antisera. Small doses (ng level) of antigen 
administered at multiple intradermal sites spears to be most idiaUft An eflfective 
immunization protocol for rabWts can be found in Vaitukaitis. et al, J. Clin Endocrinol. 
Metab. 33:988-991 (1971) , the disclosure of which is incorporated herem by reference. 

25 Booster iiqections can be given at regular intervals, and antisenon harvested when 

antibody titer thereof, as determined semi-quantitatively, for exan^le, by double 
immunodiflfiision in agar against known concentrations of the antigen, b^ins to fell. See, for 
example, Ouchteriony, et aL, Chap. 19 in: Handbook of ExperimaOal Immunology D. Wier 
(ed) Blackwdl (1973) , the disclosure of whidi is incorporated herein by reference. Plateau 

30 concentration of antibody is usually in the range of 0. 1 to 0.2 mg/ml of serum (about 1 2 mM). 
AflBnity of the antisera for the antigen is determined by preparing competitive binding curves. 
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as described, for example, by Fisher. D.. Chap. 42 in: Manual of Clinical Immunology. 2d 
Ed. (Rose and Friedman, Eds.) Amer. Soc. For Microbiol.. Washington. D.C. (1980) . the 
disclosure of which is incorporated herein by reference.. 

Antibody preparations prepared according to either protocol are useful in 
quantitative immunoassays which determine concentrations of antigen-bearing substances 
in biological samples; they are also used semi-quantitatively or qualitatively to identify 
the presence of anUgen in a biological sample. The antibodies may also be used in 
therapeutic compositions for killing ceUs expressing the protein or reducing the levels of 
the protein in the body. 

V. Use of 5' ESTs or Sequences Obtainable Therefrom or Portions Thereof as 
Reagents 

The 5' ESTs of the present invention (or cDNAs or genomic DNAs obtainable 
therefrom) may be used as reagents in isolation procedures, diagnostic assays, and 
forensic procedures. For example, sequences from the 5' ESTs (or cDNAs or genomic 
DNAs obtainable therefrom) may be detectably labeled and used as probes to isolate 
other sequences capable of hybridizing to them. In addition, sequences from 5' ESTs (or 
cDNAs or genomic DNAs obtainable therefrom) may be used to design PCR primers to 
be used in isolation, diagnostic, or forensic procedures. 

}. Use of S' EgTs or Senuences QbtainahIP T herefrom nr Portions Thereof in Isolarion 
Diagnostic and Forensi^: Pr^»^;,|.« 



2> 



EXAMPLE 44 
Preparation of PCR Prim ers and Amplification of DNA 
The 5' EST sequences (or cDNAs or genomic DNAs obtainable therefrom) may be 
used to prepare PCR primers for a variety of applications, including isolation procedures for 
cloning nucleic adds capable of hybridizing to such sequences, diagnostic techniques and 
forensic techniques. The PCR primers are at least 10 bases, and preferably at least 12. 15. or 
17 bases in length. More preferably, the PCR primers are at least 20-30 bases in length. In 
some embodiments, the PCR primers may be more than 30 bases in length. It is preferred 
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that the primer pairs have approximately the same G/C ratio, so that melting temperatures are 
approximately the same. A variety of PGR techniques are familiar to those sidUed in the art 
For a review of PGR technology, see Molecular Cloning to Genetic Engineering. White Ed. 
in Methods in Molecular Biology 67: Humana Press, Totowa 1997. the disdosure of which 
5 is incorporated herein by reference. In each of these PGR procedures, PGR primers on either 
side of die nucleic add sequences to be anq}lified are added to a suitably prepared nudeic add 
sample along with dNTPs and a thermostable polymerase such as Taq polymerase, Pfu 
polymerase, or Vent polymerase. The nuddc add in the sample is denatured and the PGR 
primers are specifically Iqrbridized to complementary nuddc add sequences in the sample. 
10 The hybridized primers are extended. Thereafter, another cycle of denaturation, 
hybridization, and extension is initiated. The cycles are repeated multiple times to produce an 
amplified Augment containing the nuddc add sequence between the primer sites. 

EXAMPLE 45 

15 Use ofSTSTs as Probes 

Probes derived fi^m 5' ESTs (or cDNAs or genomic DNAs obtainable therefi-om), 
including fiill length cDNAs or genomic sequences, may be labded with detectable labds 
familiar to those sidlled in the ait, induding radioisotopes and non-radioactive labels, to 
provide a detectable probe. The detectable probe may be single stranded or double stranded 

10 and may be made using techniques known in die art, induding in vitro transcription, nick 
translation, or kinase reactions. A nuddc acid sample containing a sequence capable of 
hybridizing to the labded probe is contacted with the labdrf If the nucldc add in the 
sample is double stranded, it may be denatured prior to contacting the probe. In some 
applications* the nuddc add sample may be immobilized on a sur&ce such as a nitrocellulose 

15 or nylon membrane. The nucldc add sample may comprise nucleic adds obtained fix)m a 
variety of sources, including genomic DNA, cDNA libraries, RNA, or tissue samples. 

Procedures used to detect the presence of nucldc acids capable of hybridizing to the 
detectable probe include weD known tediniques such as Southern blotting. Northern blotting, 
dot blotting, colony hybridization, and plaque hybridization. In some applications, the nucleic 

0 add capable of hybridizing to the labeled probe may be doned into vectors such as expression 
vectors, sequencing vectors, or in vitro transcription veaors to feciiitate the characterization 
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and expression of the hybridizing micldc acids m For example, such techniques 

may be used to isolate and clone sequences in a genomic libraiy or cDNA library which are 
capable of hybridizing to the detectable probe as described in Example 30 above. 

PCR primers made as described in Example 44 above may be used in forensic 
5 analyses, such as die DNA fingerprinting techniques described in Examples 46-50 below. 
Such analyses may utilize d^ectable probes or primeis based on the sequences of the the S' 
ESTs or of cDNAs or genomic DNAs isolated using the 5' ESTs. 

EXAi\fPLE46 

•0 Forensic Matching bv DNA Seouendnflr 

In one exemplary method, DNA samples are isolated from forensic specimens of; for 
example, hair, semen, blood or skin cells by conventional methods. A panel of PCR primers 
based on a number of the 5' ESTs of Example 25, or cDNAs or genomic DNAs isolated 
therrfom as described above, is then utilized in accordance with Example 44 to amplify DNA 

15 of approximately 100-200 bases in lengdi from the forensic spedmea Corresponding 
sequences are obtained from a test subject. Each of these identification DNAs is then 
sequenced using standard tedmiques, and a simple database comparison determines the 
differences, if any, between die sequences from the subject and those from the sample 
Statistically significant diflferences between the suspect's DNA sequences and those from the 

:0 . sample condusively prove a lack of identity. This lack of identity can be proven, for example, 
with only one sequence. Identity, on the other hand, should be demonstrated with a large 
number of sequences, all matching. Preferably, a minimum of 50 statistically identical 
sequences of 100 bases in length are used to prove ictentity bctweai the suspect and the 
sample. 

15 

EXAMPLE 47 
Positive I dentification bv DNA Seouendng 
The technique outlined in the previous example may also be used on a larger scale to 
provide a unique fingerprint-type identification of any individual. In this technique, primcn 
0 are prepared fixnn a large number of 5TST sequences from Example 25, or cDNA or 
genomic DNA sequences obtainable therefrom. Prefia^ly, 20 to 50 different primers arc 
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used. These primers are used to obtain a conesponding number of PCR-generated DNA 
segments from tlie individual in question in accordance with Example 44. Each of these 
DNA segments is sequenced, using the methods set forth in Example 46. The database of 
sequences generated through this procedure uniquely identifies die individual from whom the 
sequences were obtained. The same pand of primers may then be used at any later Ume to 
absolutely correlate tissue or other biological q)eciraen with tfutt indivicb^. 



EXAMPLE 48 
Southern Blot Forensic Identification 
10 The procedure of Example 47 is repeated to obtain a panel of at leaa 10 amplified 

sequences fiom an individual and a spedmea Preferably, the pand contains at least 50 
amplified sequences. More prefiaaWy, the pand contains 100 amplified sequences. In some 
en*odiments, the pand contains 200 an^iified sequences. Tliis PCR-generated DNA is then 
digested witb one or a combination o^ preferably, four base specific restriction enzymes. 

15 Such enzymes are commercially available and known to those of sldD in the art. After 
digestion, the resultant gene fiagments are size separated in multiple ckiplicate wells on an 
agarose gd and transfemd to nitrocellulose using Southern blotting techniques weU known 
to those with skin in the art. For a review of Southern blotting see Davis et aL (Basic 
Methods in Molecular Biology. 1986, Elsevier Press, pp 62-65) , the disdosure of which is 

20 incorporated herein by reference. 

A pand of probes based on the sequences of 5' ESTs (or cDNAs or genomic DNAs 
obtainable therefixMnX cw fiagments thereof of at least 10 bases, are radioactivdy or 
coloiimetrically labded using methods known in the art, such as nick translation or end 
labeling, and hybridized to die Southern bbt using techniques known in the art (Davis et at.. 

25 supra). Preferably, the |xx)be c«iq)rises at least 12, 15, or 17 consecutive nudeotides ftom 
the 5' EST (or cDNAs or genomic DNAs obtainable therefrom). More preferably, the probe 
comprises at least 20-30 consecutive nucleotides from the 5' EST (or cDNAs or genomic 
DNAs oteainable therefrom). In some en^iments, the probe comprises more than 30 
nudeotides from the 5' EST (or cDNAs or genomic DN.As obtainable therefrom). 

30 Preferably, at least 5 to 10 of these labded probes are used, and more preferably at 

least about 20 or 30 are used to provide a unique pattern. The resukant bands appearing 
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fiiom the hybridization of a laige sampte of 5' EST (or cDNAs or genomic DNAs obtainable 
therefrom) wiO be a unique identifier. Since the restriction enzyme deavage will be difFerent 
for eveiy individual, the band pattern on the Southern blot will also be unique. Inaeasing the 
number of 5' EST (or cDNAs or genomic DNAs obtainable therefrom) probes wiU provide a 
"5 stausticaBy higher level of confidence in the identification since there wiU be an incieased 
number of sets of bands used for identification. 

EXAMPLE 49 
Dot Blot Identifimtin n Procedure 
10 Anotijer technique for identi^g individuals usii% the 5' EST sequences disclosed 

herein utilizes a dot blot hybridization tedmique. 

Genomic DNA is isolated fix>ra vuda of subject to be identified. Oligonucleotide 
probes of sp^oramatgiy 30 bp in length are synthesized that contend to at least 10, 
preferably 50 sequences fiwn the 5' ESTs or cDNAs or genomic DNAs obtainable 
15 therefirem. The probes are used to hybridize to the genomic DNA through conditions known 
to tiiose in the ait The oligonudeorides are end labeled witii P*' using polynudeoude kinase 
(Phannada). Dot Blots are created by spotting the genomic DNA onto nitroceUulose or die 
like using a vacuum dot blot manifold (BioRad, Richmond California). The nitroceUulose 
filter containing die genomic sequences is baked or UV linked to die filter, prehybridized and 
20 Iq^ridized with labded probe using techniques known in die art Paviset a/., iipra The"P 
labded DNA fragments are sequentially Itybridized with successively stringent conditions to 
detea minimal diflFerences between die 30 bp sequence and die DNA. 
Tetramediytonnionium chloride is usefiil for identifying dones containing smaU numbere of 
nucleotide mismatches (Wood etoL, Proc Natl. ActuL ScL USA 82(6): 1 585- 1588. 1985) 
25 which is hereby incoiporated by reference. A unique pattern of dots distinguishes one 
individual fiiom anotho' imfividuaL 

5' EST sequences (or cDNAs or genomic DNAs obtainable tiierefrom) or 
oligonucleotides containing at least 10 cmsecutive bases from diese sequences can be used as 
probes in the following akemative fii^erprinting technique. Preferably, die probe comprises at 
30 least 12. 15, or 17 consecutive nudeotides fit)m the 5' EST sequences (or cDNAs or 
genomic DNAs obtainable therefrom). More preferably, the probe comprises at least 20-30 
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consecutive nucleotides from the S' EST sequences (or cDNAs or genomic DNAs obtainable 
therefrom). In some embodiments, the probe comprises more than 30 nucleotides from the 5' 
EST sequences (or cDNAs or genomic DNAs obtainable therefrom). 

Preferably, a plurality of probes having sequences from different genes are used in the 
5 alternative fingerprinting technique. Example 50 below provides a representative altemadve 
fingerprinting procedure in which the probes are derived from 5 'EST. 

EXAMPLE 50 
Alternative "Fineeq^rint * Identification Technique 
10 20-mer oligonucleotides are prepared from a large number, e.g. 50, 100, or 200, of 

5'EST using commercially available oligonucleotide services such as Gcnset, Paris, France. 
CeU samples from the test subject are processed for DNA using techniques well known to 
those with skill in the art. The loicleic acid is digested with restriction enzymes such as EcoRI * 
and XbaL Following digestion, samples are applied to wells for electrophoresis. The 
15 procedure, as known in the art, may be nwdified to accommodate polyacrylamide 
electrophoresis, however in this example, samples containing 5 ug of DNA are loaded into 
wells and separated on 0.8% agarose gels. The gels are transferred onto nitrocellulose using 
standard Southern blotting techniques. 

10 ng of each of the oligonucleotides are pooled and end-labeled witii ^^P. The 
20 nitrocellulose is prehybridized with blocking solution and hybridized with the labeled probes. 
Following hybridization and washing, the nitrocellulose fiher is exposed to X-Omat AR X-ray 
film The resulting hybridization pattern will be unique for each individual. 

It is additionally contemplated within this example that the number of probe 
sequences used can be varied for additional accuracy or clarity. 

25 

The proteins encoded by the extended cDNAs may also be used to generate 
antibodies as explained in Examples 30 and 43 in order to identify the tissue type or cell 
species fipom which a sample is derived as described inexampleSI. 



30 



wo 99/06549 



PCT/IB98/01231 



92 

EXAMPLE 51 

Identification of Tissue Types or Cell Species bv Means of 
Labeled Tiss ue Specific Antibodies 
Identification of specific tissues is accomplished by the visualization of tissue specific 
5 antigens by means of antibody preparations according to Examples 30 and 43 which are 
conjugated, directly or indirecdy to a detectable marker. Selected labeled antibody species 
bind to thdr specific antigen binding partner in tissue sections, cell suspensions, or in extracts 
of soluble proteins fi-om a tissue sample to provide a pattern for qualitative or semi-qualitative 
imerpretation. 

10 Antisera for these procedures must have a potency exceeding that of the native 

preparation, and for that reason, antibodies are concentrated to a mg/ml level by isolation 
of the gamma globulin fraction, for example, by ion-exchange chromatography or by 
ammonium sul&te firactionation. Also, to provide the most ^ecific antisera, unwanted 
antibodies, for example to common proteins, must be removed &om the gamma globulin 

1 S fraction, for example by means of insoluble immunoabsorbents, before the antibodies are 
labeled with the maricer. Either monoclonal or heterologous antisera is suitable for either 
procedure. 

A. Immunohistochemical techniques 

Purified, high-titer antibodies, prepared as described above, are conjugated to a 

20 detectable maricer, as described, for example^ by Fudenberg, Chap. 26 in; Basic coid Clinical 
Immunology, 3nl EA Lange, Los Altos, California, 1980, or Rose,, et ai. Chap. 12 in: 
Methods in InwnmxSagnosis^ 2d Ed. John Wiley and Sons, New York (1980), the 
disclosures of which are incorporated herein by reference. 

A fluorescent marker, either fluorescein or rhodamine, is preferred, but antibodies can 

25 also be labeled with an enzyme that supports a color producing reaction with a substrate, such 
as horseradish peroxidase. A4arkers can be added to tissue-bound antibody in a second step, 
as described below. Alternatively, the specific antitissue antibodies can be labeled with ferritin 
or other electron dense particles, and localization of the fenidn coupled antigen-antibody 
complexes achieved by means of an electron microscope. In yet another approach, the 

30 antibodies are radiolabeled, with, for example '^^I, and deteaed by overlaying the antibody 
treated preparation with photographic emulsion 
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Preparations to cany out the procedures can comprise ntonoclonal or polyclonal 
antibodies to a single protein or peptide identified as specific to a tissue type, for example, 
brain tissue, or antibody preparations to several antigenically distina tissue specific antigens 
can be used in panels, independently or in mixtures, as required. 
5 Tissue sections and cell suspensions are prepared for inununohistochemical 

examination according to common histological techniques. Multiple cryostat seaions (about 
4 ^im, unfixed) of the unknown tissue and known control are mounted and each slide 
covered with different dilutions of the antibody preparatioa Sections of known and unknown 
tissues should also be treated with prq)arations to provide a positive control, a negative 
10 control, for example, pre-imnnme sera, and a control for non-specific staining, for example, 

buffer. ^ 

Treated sections are incubated in a humid chamber for 30 min at room temperature, 
rinsed, then washed in buffer for 30-4S mia Excess fluid is blotted away, and the marker 
devdoped. 

15 If the tissue specific antibocty was not labeled in the first incubation, it can be labeled 

at this time in a second antibody-antibody reaction, for example, by adding fluorescein- or 
enzyme-conjugated antibody against the immunoglobulin class of the antiserum-producing 
species, for example, fluorescdn labeled antibody to mouse IgG. Such labeled sera are 
commercially available. 

20 The antigen found in the tissues by the above procedure can be quantified by 

measuring the intensity of color or fluorescence on the tissue section, and calibrating that 

signal using appropriate standards. 

B. Identification of tissue specific soluble proteins 

The visualization of tissue specific proteins and identification of unknown tissues fi-om 
25 that procedure is carried out u^ng the labeled antibody reagems and deteaion strategy as 

described for immunohistochemistry; however the sample is prepared according to an 

electrophoretic technique to distribute the proteins extracted fi-om the tissue in an orderiy 

array on the basis of molecular weight for deteaion. 

A tissue sample is homogenized using a Vinis apparatus; cell suspensions are 
30 disrupted by Dounce homogenization or osmotic lysis, using detergents in either case as 

required to disrupt cell membranes, as is the practice in the an. Insoluble cell components 
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such as nuclei, microsomes, and membrane fragments are removed by ultracemrifiigarion. and 
the soluble protein-containing fraction concentrated if necessary and reserved for analysis. 

A sample of the soluble protein solution is resolved into individual protein species by 
conventional SDS polyaoylamide electrophoreas as described, for example, by Davis, et aL, 
5 Section 19-2 in: Basic Meihext in Mokctilar Biology, Ledcr ed., Elsevier, New York. 1 986, 
the disclosure of which is incorporated herein by reference, using a range of amounts of 
polyaoylamide in a set of gels to resolve the entire molecular weight range of proteins to be 
detected in the sample. A aze marker is run in parallel for purposes of estimating molecular 
weights of the constituent proteins. Sample size for analysis is a convenient volume of from 5 

10 to 55 ^1, and containing from about 1 to 100 |ig protein. An aliquot of each of the resolved 
proteins is transferred by blotting to a nitrocellulose filter paper, a process that maintains the 
pattem of resolution. Multiple copies are prepared The procedure, known as Western Bioi * 
Analysis, is well described in Davis, L. et al., stpra Section 19-3. One set of nitrocellulose 
blots is stained with Coomassie blue dye to visualize the entire set of proteins for comparison 

1 5 with the antibody bound proteins. The remaining nitrocelluiose filters are then incubated whh 
a solution of one or more specific antisera to tissue specific proteins prepared as described in 
Examples 30 and 43. In this procedure, as in procedure A above, appropriate positive and 
negative sample and reagent controls are run. 

In either procedure A or B, a detectable labd can be attached to the primary tissue 

20 antigen-primary antibody conqplex according to various strategies and permutations thereof 
In a stiaightforward approadi, the primary specific antibody can be labeled; alternatively, the 
unlabeled complex can be bound by a labeled secondary anti-IgG antil)ody. In other 
approaches, either the primary or secondary antibody is conjugated to a biodn molecule, 
which can, in a subsequent sXcp, bind an avidin conjugated marker. According to yet another 

25 strategy, enzyme labeled or radioactive protein A, which has the property of binding to any 
IgG, is bound in a final step to either the primary or secondary antibody. 

The visualization of tissue specific antigen binding at levels above those seen :n 
control tissues to one or more tissue specific antibodies, prepared from the gene sequences 
identified Srom extended cDNA sequences, can identify tissues of unknowTi origin, for 

30 example, forensic samples, or difierentiated tumor tissue that has metastasized to foreigr. 
bodily sites. 
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In addition to their applications in forensics and identification, 5' ESTs (or 
cDNAs or genomic DNAs obtainable therefrom) may be mapped to their chromosomal 
locations. Example 52 below describes radiation hybrid (RH) mapping of human 
chromosomal regions using 5 'ESTs. Example 53 below describes a representative 
5 procedure for mapping an 5' EST to its location on a human chromosome. Example 54 
below describes mapping of 5' ESTs on metaphase chromosomes by Fluorescence In 
Situ Hybridization (FISH). Those skilled in the art will appreciate that the method of 
Examples 52-54 may also be used to map cDNAs or genomic DNAs obtainable from the 
5' ESTs to their chromosomal locations. 

10 

2. Use of 5' ESTs or Sequences Obtainable Therefrom or Portions Thereof in 
Chromosome Mapping 

EXAMPLE 52 
Radiation hybrid mapping of 5'ESTs to the human genome 

15 Radiation hybrid (RH) mapping is a somatic cell genetic approach that can be used 

for high resolution mapping of the human genome. In this approach, cell lines containing one 
or more human chromosomes are lethally irradiated, breaking each chromosome into 
fragments whose size depends on the radiation dose. These fragments are rescued by fusion 
with cuhured rodent cells, yielding subclones containing different portions of the human 

20 .genome. This technique is described by Benham et aL, Gerumics 4:509-517, 1989; and Cox 
et aL, Science 250:245-250, 1990, the entire contents of which are hereby incorporated by 
reference. The random and independent nature of the subclones permits efficient mappii^ of 
any human genome marker. Human DNA isolated from a panel of 80-100 cell lines provides 
a mapping reagent for ordering 5'EST In this approach, the frequency of breakage between 

25 markers is used to measure distance, allowing construction of fine resolution maps as has 
been done using conventional ESTs (Schuler et aL, Science 274:540-546, 1996, hereby 
incorporated by reference). 

RH mapping has been used to generate a high-resolution whole genome radiation 
h>trid map of human chromosome 17q22-q25.3 across the genes for growth hormone (GH) 

30 and thymidine kinase (TK) (Foster et ai. Genomics 33:185-192. 1996), the region 
surrounding the Goriin syndrome gene (Obennayr et ai, Eur. J. Hum, Genet, 4:242-245, 
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1996). 60 lod covering the entire short ann of chronwsome 12 (Raeyinaekers et aL, 
Genomics 29:170-178. 1995), the region of human chromosome 22 containing the 
neurofibromatosis type 2 locus (Frazer e/o/.. Genomics 14:574-584. 1992) and 13 loci on the 
long arm of chromosome 5 (Wanington etcd.. Genomics 11:701-708, 1991). 

5 

EXAMPLE S3 

Mapping of S'ESTs to HumanChro mosomes using PCR techniq ues 
5' ESTs (or cDNAs or genomic DNAs obtainable therefrom) may be assigned to 
human chromosomes using PCR based methodologies. In such approaches. oUgonucIeotide 

10 primer pairs are designed from the 5' ESTs (or cDNAs or genomic DNAs obtainable 
therefrom) to minimize the chance of amplifying through an intron. Preferably, die 
oUgonucIeotide primers are 18-23 bp in length and are designed for PCR amplificatioa The 
creation of PCR primen from known sequences is well known to those with skill in the art 
For a review of PCR technology see Eriich in PCR Technology, Principles and Applications 

15 for DMA Amplification, Freeman and Co., New York, 1992, the disclosure of which is 
inc(xporated herein by rderence. 

The primers are used in potymerase chain rcMtions (PCR) to amplify templates from 
total human genomic DNA. PCR conditions are as follows: 60 ng of genomic DNA is used 
as a tem{^ for PCR with 80 ng of each oligonucleotide primer, 0.6 unit of Taq polymciase. 

20 .and 1 pCu of a "P-Iabded deoxycytidine triphosphate. The PCR is performed in a 
microplatc thernwcyder (Tedme) under the following conditions: 30 cycles of 94X, 1 .4 min; 
SS^C, 2 min; and 72°C, 2 min; with a final extension at 72^ for 10 min. The amplified 
products are analyzed on a 6% polyacrylamide sequencing gel and visualized by 
autoradiography. If the length of the resulting PCR produa is identical to the distance 

25 between the ends of the primer sequences in the extended cDNA from which the primere are 
derived, then the PCR reaction is repeated witii DNA templates from two panels of human- 
rodent somatic ceU hybrids. BIOS PCRable DNA (BIOS Corpotaiion) and NIGMS Human- 
Rodent Somatic CeD Hybrid Mapping Panel Number I (NIGMS, Camden, NJ). 

PCR is used to screen a series of somatic cell hybrid cell fines containing defined sets 

30 of human chromosomes for the presence of a given 5' EST (or cDNA or genomic DNA 
obtainable therefix>m). DNA is isolated from the somatic hybrids and used as starting 
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templates'for PCR.reactions using the primer pairs from the 5' EST (or cDNA or genomic 
DNA obtainable therefrom). Only those somatic ceU hybrids with chromosomes containing 
the human gene corresponding to the 5' EST (or cDNA or genomic DNA obtainable 
therefrom) will yield an amplified fragment. The 5' EST (or cDNA or genomic DNA 

5 obtainable therefrom) are assigned to a chromosome by analysis of the segregation pattern of 
PGR products from the somatic hybrid DNA templates. The single human chromosome 
present in aO ceO hybrids that give rise to an amplified firagment is the chromosome containing 
diat 5'EST (or cDNA or genomic DNA obtainable therefrom). For a review of techniques 
and analysis of results from somatic cell gene mapping experiments, see Ledbetter el al., 
10 Genomics 6:475-481. 1990. the disclosure of which is incorporated herein by reference. 

EXAMPLE 54 

Mapping of Extended S' ES Ts to Chromosomes Using nnorescence Tn SSihi 

Hybridization 

15 Euoresccnce in situ hybridization allows the 5'EST (or cDNA or gnomic DNA 

obtainable dierefrom) to be mapped to a particular location on a given chromosome. The 
chTMHosomes to be used fx fluorescence in situ fQfbridizatten techniques may be obtained 
from a variety of sources induding cell cultures, tissues, or whole blood. 

In a preferred embodiment, chromosomal localization of an 5'EST (or cDNA or 

20 -genomic DNA obtainable therefiiom) is obtained by FISH as described by Cherif aL {Proc. 
NatL Acad. Sci. U.SjL, 87:6639-6643. 1990) , the disclosure of which is incoiporated herein 
by reference.. Metaphase chromosomes are prepared fix>m phytohemagghidnin (PHA)- 
stimulated blood ceO donors. PHA-stimulated lymphocytes &om heahhy males are cultured 
for 72 hinRPNa-1640 medhim. For synchronization, mediotrexate (10 pM) is added for 17 

25 h, followed by addition of 5-bromodeoxyuridine (5-BrdU, 0.1 mM) for 6 h. Colcemid (1 
|jg/ml) is added for tiie last 15 min before harvesting the cells Cells are collected, washed in 
RPMI, incubated with a hypotonic solution of KQ (75 mM) at 3T'C for 15 min and fixed in 
three changes of methanoliacetic add (3:1). The cell suspension is dropped onto a glass slide 
and air dried. The 5'EST (or cDNA or genomic DNA obtainable therefrom) is labeled with 

30 biotin-16 dUTP by nick translation according to the manufacturer's instructions (Bethesda 
Research Laboratories, Bethesda, MD), purified using a Sephadec G-50 column (Phannada, 
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Upsala, Sweden) and precipitated. Just prior to hybridization, the DNA pcUet is dissolved in 
hybridization buffer (50% fbnnamide. 2 X SSC. 10% dexttan sulfate, I mg/ml sonicated 
salmon spem DNA, pH 7) and the probe is denatured at 70X for 5-10 min. 

Slides kept at .20»C are treated for I h at 37°C with RNase A (100 ug/ml). rinsed 
5 three times in 2 X SSC and dehydrated in an ethanol series. Chromosome preparations are 
denatured in 70% formamide. 2 X SSC for 2 min at 70°C. then dehydrated at 4»C. The 
slides are treated with proteinase K (10 ng^lOO ml in 20 mM Tris-HCl, 2 mM CaClz) at 37»C 
for 8 min and dehydrated. The hybridization mbcture containing the probe is placed on the 
slide, covered with a coverslip. sealed with rubber cement and incubated overnight in a humid 
10 chamber at 3TC. After hybridization and post-hybridization washes, the biotinylated probe is 
deteaed by avidin-FITC and amplified with additional layers of biotinylated goat anti-avidin 
and avidin-FITC. For chromosomal localization, fluorescent R-bands are obtained as 
previously described (Cherif et aL, supru.). TTie slides are observed under a LHCA 
fluorescence microscope (DMRXA). ChromosonMs are counterstained with propidium 
15 iodide and the fluorescent signal of the probe appears as two symmetrica] yellow-green spots 
on both chromatids of tiie fluorescent R-band chromosome (red). Thus, a particular 5'EST 
(or cDNA or genomic DNA obtainable tiierefrom) may be localized to a particular 
cytogenetic R-band on a given chromosome. 

20 Once die 5*EST (or cDNA or genomic DNA obtainable tiierefrom) have been 

assigned to particular chromosomes using the techniques described in Examples 52-54 above, 
tiwy may be utilized to construa a high resolution map of die chromosomes on which tiiey 
are located or to identify die chromosomes in a sample. 



EXAMPLE 55 
Use of 5'EST to Construct nr Expand Chro mosome Maps 
Chromosome mating invokes assigning a given unique sequence to a particular 
chromosome as described above. Once the unique sequence has been mapped to a given 
chromosome, it is ordered relative to other unique sequences located on die same 
chromosome. One approach to chromosome mapping utilizes a series of yeast artificial 
chromosomes (YACs) bearing several diousand long inserts derived from the chromosomes 
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of the organism from which the extended cDNAs (or genomic DNAs obtainable therefrom) 
are obtained. This ^proach is described in Nagaraja et at.. Genome Research 7:210-222, 
1997, the disclosure of which is incorporated herein by reference. Briefly, in this approach 
each chromosome is broken into overlapping pieces which are inserted into the YAC veaor. 
5 The YAC inserts are screened using PCR or odier methods to determine whether they 
include the 5'EST (or cDNA or genomic DNA obtainable therefrom) whose position is to be 
determined. Once an insert has been found which includes the 5*EST (or cDNA or genomic 
DNA obtainable therefrom), the insert can be analyzed by PCR or other methods to 
determine whether the insert also contains other sequences known to be on the chromosome 

10 or in the region from which the 5'EST (or cDNA or genomic DNA obtainable therefrom) 
was derived. This process can be repeated for each insert in the YAC library to determine the 
location of each of the extended cDNAs (or genomic DNAs obtainable therefrom) relative to *js 
one another and to other known chromosomal markers. In this way. a high resolution map of r 
the distribution of numerous unique markers along each of the organisms chromosomes may 

IS be obtained. 



As described in Example 56 below extended cDNAs (or genomic DNAs obtainable 
therefrom) may also be used to identify genes associated with a particular phenotype, such as 
hereditary disease or drug response. 

20 

3. Use of S^ESTs or Sequenc es Obtained Therefrom or Fragments Thereof in Gene 
Identification 

EXAMPLE 56 

Identification of een es associated with hereditary diseases or drug response 
25 This example ilhistrates an approach useful for die association of 5'ESTs (or cDNA 

or genomic DNA obtainable therefrom) with particular phenotypic characteristics. In ihis 
example, a particular 5'EST (or cDNA or genomic DNA obtainable therefrom) is used as a 
test probe to associate that 5*EST (or cDNA or genomic DNA obtainable therefrom) with a 
particular phenotypic characteristic. 
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5*ESTs (or cDNA or genomic DNA obtainable therefrom) are mapped to a partiailar 
location on a human chromosome using techniques such as those described in Examples 52 
and 53 or other techniques known in the art. A search of Mendelian Inheritance in Man 
(McKusick in Mendelian Inheritmice in Man (available on line through Johns Hopkins 
5 University Welch Medical Libraiy) reveals the region of the human chromosome which 
contains the 5'EST (or cDNA or genomic DNA obtainable therefrom) to be a very gene rich 
region containmg several known genes and several diseases or phenotypes for which genes 
have not been identified. The gene corresponding to this 5'EST (or cDNA or genomic DNA 
obtainable therefrom) thus becomes an immediate candidate for each of these genetic 
10 diseases. 

Cells from patients with these diseases or phenotypes are isolated and expanded ^ 
in culture. PCR primers from the 5'EST (or cDNA or genomic DNA obtainable > 
therefrom) are used to screen genomic DNA, mRNA or cDNA obtained from the ^ 
patients. 5*ESTs (or cDNA or genomic DNA obtainable therefrom) that are not 
15 amplified in the patients can be positively associated with a particular disease by further 
analysis. Alternatively, the PCR analysis may yield fragments of different lengths when 
the samples are derived from an individual having the phenotype associated with the 
disease than when the sample is derived from a healthy individual, indicating that the 
gene containing the 5'EST may be responsible for the genetic disease. 

20 

VI. Use of S'EST (or cDNA or Genomic DNA Obtainable Therefrom) to Construct 
Vectors 

The present 5'ESTs (or cDNA or genomic DNA obtainable therefrom) may also 
be used to construct secretion vectors capable of directing the secretion of the proteins 
25 encoded by genes therein. Such secretion vectors may facilitate the purification or 
enrichment of the proteins encoded by genes inserted therein by reducing the number of 
background proteins from which the desired protein must be purified or enriched. 
Exemplary secretion vectors are described in Example 57 below. 
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1. Coostnict ion of Secretion Vectors 

EXAMPLE 57 
Constmction of Secr etion Vectors 
The secretion vectors include a promoter capable of directing gene expression in the 
host cell, tissue, or organism of interest. Such promoters include the Rxhjs Sarcoma Virus 
promoter, the SV40 promoter, the human cytomegalovirus promoter, and other promoters 
^miliar to those skilled in the art. 

A signal sequence from a 5' EST (or cDNAs or genomic DNAs obtainable 
therefrom) is opcrably linked to die promoter such that the mRNA transcribed fit)m the 
promoter win direa the translation of the signal peptide. The host ceU, tissue, or organism 
may be any ceD, tissue, or organism which recognizes the signal peptide encoded by the signal 
sequence in the 5' EST (or cDNA or genomic DNA obtainable therefrom). Suitable hosts 
include mammalian cells, tissues or organisms, avian cells, tissues, or organisms, insea cdls, 
tissues or organisms, or yeast 

In addition, the secretion vector contains cloning sites for inserting genes encoding 
tiie proteins which are to be secreted. The cloning sites feciKtate the cloning of the insert 
gene in frame with the signal sequence such that a fusion protein in vfiich the signal peptide is 
fused to the protein encoded by tiie inserted gene is expressed from the mRNA tiansciibed 
from ti)e promoter. The signal peptide directs the extraceBular secretion of the fusion proteia 
The secretion vector may be DNA or RNA and may int^rate imo die chromosome 
of die host, be stably maintained as an cxtrachromosomal rcplicon in the host, be an artificial 
chromosome, or be transiemly present in die host. Many nucleic ackl backbones suitable for 
use as secretion vectors are known to tiiose skilled in the art, including retroviral vectors, 
SV40 vectors. Bovine Papilloma Virus vectors, yeast integrating plasmids. yeast cpisomal 
plasmids, yeast artificial diromosomes, human artificial chromosomes, P element vectors, 
baculovinjs vectors, or bacterial plasmids capable of being transiently introduced into the 
host. 

The secretion vector may also contain a polyA signal such diat the polyA signal is 
located downstream of die gene inserted into the secretion veaor. 

After die gene encoding the protein for which secretion is desired is inserted into the 
secretion veaor, die secretion veaor is introduced into the host cell, tissue, or organism using 
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calcium phosphate precipitation. DEAE-Dextran. elearoporation, liposomeHnediated 
transfection, viral panicles or as naked DNA. The protein encoded by the insened gene is 
then purified or enriched fiom the supernatant using conventional techniques such as 
ammonium sul&te precipitation, imnumopredpitation, immunochromatography, size 
exclusion chromatography, ion exchai^e chromatography, and HPLC. Altemativdy, the 
secreted protein may be in a sufficiently enriched or pure state in the supernatant or growth 
media of the host to permit it to be used for its intended purpose widiout further enrichment 

The signal sequences may also be inserted into vectors designed for gene therapy. In 
such vectors, the signal sequence is opcrably linked to a promoter such that mRNA 
U-anscribed from the promoter encodes the signal peptide. A cloning site is located 
downstream of the signal sequence such that a gene encoding a protein whose secretion is 
desired may readily be inserted into the vector and fused to the signal sequence The vector is 
introduced into an appropriate host cell The protein expressed from the promoter is scaeted 
extracellulariy, dierd)y producing a tiaenpeutic effect 

The 5' ESTs may also be used to done sequences located upstream of the 5' 
ESTs which are capable of regulating gene expression, inclucfing promoter sequences, 
enhancer sequences, and other upstream sequences which influence transcription or 
translation levds. Once identified and cloned, these upstream regulatory sequences may 
be used in expression vectors designed to direct the expression of an inserted gene in a 
desired spatial, temporal, developmental, or quantitative feshion. Example 58 describes 
a method for cloning sequences upstream of the extended cDNAs or 5' ESTs. 

2. Identification of Upstream Sequences Wi th Promoting or Regulatory Activities 
25 EXAMPLE 58 

Use of Extended cDNAs or 5' ESTs t o done Upstream Sequences from Genomic TWA 
Sequences derived fiom extended cDNAs or 5' ESTs may be used to isolate the 

promoters of the corresponding genes using chromosome waUdi^ techniques. In one 

chromosome walking technique, which utilizes the GenomeWalker"* kit available from 
30 Clontech, five complete genomic DNA samples are each digested witii a different restriction 
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enzyme whidi has a 6 base recognition site and leaves a bhint end. FoUowing digestion. 
oUgonucIeotide adaptere are Ugated to each end of the resuitir^ genomic DNA fiagments. 

For each of the five genomic DNA libraries, a first PCR reaction is performed 
according to the manu&cturer's instructions (which are incorporated herein by reference) 
5 using an outer adaptor primer provided in the kit and an outer gene specific primer. The gene 
specific primer should be selected to be specific for the extended cDNA or 5' EST of interest 
and should have a mdting temperature, length, and location in the extended cDNA or 5'EST 
which is consistent with hs use in PCR reactions. Each first PCR reaction contains 5 ng of 
genomic DNA, 5 fU of lOX Tth reaction buflFer. 0 2 mM of each dNTP. 0.2 nM each of outer 
10 adaptor primer and outer gene specific pnmer, 1.1 mM of Mg(0Ac)2, and 1 pi of the Tth 
polymerase SOX mix in a total volume of 50 ul. The reaction cycle for the first PCR reaction 
is as foUows: 1 min - 94X / 2 sec - 94»C, 3 min - 72^ (7 cycles) / 2 sec - 94"'C. 3 min - 
6TC (32 cycles) / 5 min - 6TQ. 

The product of the first PCR reaction is diluted and used as a template for a 
15 second PCR reaction according to the manufacturer's instructions using a pair of nested 
primers which are located imemally on the amplicon resulting from the first PCR 
reaction. For example, 5 pi of the reaction product of the first PCR reaction mixture 
may be diluted 180 times. Reactions are made in a 50 pi volume having a composition 
identical to that of the first PCR reaction except the nested primers ire used. The first 
20 .nested primer is specific for the adaptor, and is provided with the'GenomeWalker™ kit 
The second nested prinjer is specific for the particular extended cDNA or 5' EST for 
which the promoter is to be cloned and should have a melting temperature, length, and 
location in the extended cDNA or 5' EST which is consistent with its use in PCR 
reactions. The reaction parameters of the second PCR reaction are as follows; I min - 
25 94'»C / 2 sec - 94°C, 3 min - 72''C (6 cycles) / 2 sec - 94»C. 3 min - 67°C (25 cycles) / 5 
min - ()TC. The product of the second PCR reaction is purified, cloned, and sequenced 
using standard techniques. 

Akemativdy, two or more human genomic DNA libraries can be constructed b> 
usii^ two or more restriction enzymes. The digested genomic DNA is cloned into veaors 
30 which can be converted into single stranded, circular, or finear DNA A biotinylated 
oligonucleotide comprising at least 15 nucleotides from tiie extended cDNA or 5' EST 
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sequence is hybridized to the single stranded DN/V. Hybrids between the biotinylated 
oUgonucleotide and the single stranded DNA containing die extended cDNA or EST 
sequence are isolated as described in Example 29 above. Thereafter, the single suanded 
DNA containing the extended cDNA or EST sequence is released from die beads and 
converted into double stranded DNA using a primer specific for the extended cDNA or 5' 
EST sequence or a primer corresponding to a sequence included in the cloning vector. The 
resulting double stranded DNA is transformed into bacteria. DNAs containing the 5' EST or 
extended cDNA sequences are identified by colony PCR or colony hybridizatioa 



Once the upstream genomic sequences have been cloned and sequenced as described 
above, prospective promoters and transcription start sites within the upstream sequences may 
be identified by comparing the sequences upstream of die extended cDNAs or 5' ESTs with 
databases containing known transoiption start sites, transcription factor binding sites, or 
promoter sequences. 

15 In adcfition, promoters in the upstream sequences mi^ be identified using promoter 

reporter vectors as described in Example. 



EXAMPLE 59 
Idemification o f Promoters in Cloned Unstream Sequences 

20 . The genomic sequences upstream of the extended cDNAs or 5' ESTs are cloned into 
a suitable promoter reporter vector, such as the pSEAP-Basic, pSEAP-Enhancer. pPgal- 
Basic, p^gal-Enfaanoer. or pEGFP-l Promoter Reporter vectors available from Qontech. 
Briefly, each of these pn>moter reporter veaors inchide multiple doning sites positioned 
upstream of a iqwrter gene encoding a readily assayable protein such as secreted alkaline 

25 phoq)hatase, p galactosidase, or green fluorescent proton. The sequences upstream of the 
extended cDNAs or 5' ESTs are inserted into the cloning sites upstream of the reporter gene 
in both orientations and inu-oduced into an appropriate host ccU. The level of reporter prote:n 
is assayed and compared to die level obtained from a vertor which lacb an insert in the 
cloning site. The presence of an elevated expression level in die veaor containing die insen 

30 with respect to die control vector indicates the presence of a pnjmoter in the insen I: 
necessary, die upstream sequences can be cloned into vectors which contain an enhancer for 
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augmenting transcription levels from weak promoter sequences. A significant level of 
expression above that observed with the vector lacicing an insert indicates diat a promoter 
sequence is present in the inserted upstream sequence. 

Appropriate host cells for the promoter reporter veaors may be chosen based on the 
results of the above described determination of expression patterns of the extended cDNAs 
and ESTs. For example, if the expression pattern analysis indicates that the mRNA 
corresponding to a particular extended cDNA or 5' EST is expressed in fibroblasts, the 
promoter reporter vector may be introduced into a human fibroblast cell line. 

Promoter sequences within the upstream genomic DNA may be further defined by 
constraaing nested deletions in the upstream DNA using conventional techniques such as 
Exonuciease DI digestion. The resulting deletion fragments can be inserted into the promoter 
reporter veaor to detennine whether the deletion has reduced or obliterated promoter 
activity. In this way, the boundaries of the promoters may be defined. If desired, potential 
individual regulatory sites within the promoter may be identified using site direaed 
mutagenesis or linker scanning to obliterate potential transcription fector binding sites within 
the promoter individually or in combination. The effects of tiiese mutations on transcription 
levels may be determined by inserting the mutations into the doning sites in the promoter 
reporter veaas. 



20 EXAMPLE 60 

Cloning and IH entification of Promoters 
Using the method described in Example 58 above with 5' ESTs, sequences upstream 
of sev«al genes were obtained. Uang the primer pairs GGG AAG ATG GAjG ATA GTA 
TTG CCT G (SEQ ID NO:29) and CTG CCA TGT ACA TGA TAG AGA GAT TC (SEQ 
25 ED NO:30). the promoter having the internal designation PI3H2 (SEQ ED N0:31) was 
obtained. 

Using the primer pairs GTA CCA GGGG ACT GTG ACC ATT GC (SEQ ID 
NO:32) and CTG TGA CCA TTG CTC CCA AGA GAG (SEQ ID NO:33). the promoter 
having the internal designation P 15B4 (SEQ ID NO:34) was obtamed. 
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Using the primer pairs CTG GGA TOG AAG GCA CGG TA (SEQ E> NO:35) and 
GAG ACC ACA GAG CTA GAG AA (SEQ ID NO:36). the promoter having the internal 
designation P29B6 (SEQ ID NO:37) was obtained. 

Figure 4 provides a schematic description of the promoters isolated md the way they 
5 are assembled with the corresponding 5' tags. The upstream sequences were screcfied for the 
presence of motifs resembling transcription factor binding sites or known transcription start 
sites using the computer program Matlnspector release 2.0, August 1996. 

Table VII describes the transcription ftctor binding sites present in each of these 
promoters. The columns labeled mauice provides the name of the Matlnspector matrix used. 
1 0 The column labeled position provides the 5' position of the promoter site. Numeration of the 
sequence starts from the transcription site as determined by matching the genomic sequence 
with the 5' EST sequence. The column labeled "orientation" indicates the DNA strand on 
which the site is found, with the -t- strand being the coding strand as detennined by matching 
the genomic sequence with the sequence of the 5' EST. The column labeled "score" provides 
1 5 the Matlnspector score found for this site. The column labeled "length" provides the length 
of the site in nucleotides. The column labeled "sequence" provides the sequence of the site 
found. 

Bacterial clones containing plasmids containing the promoter sequences described 
above described above arc presently stored in the inventor's laboratories under the internal 

20 . identification numbers provided above. The inserts may be recovered from the deposited 
materials by growing an aliquot of the appropriate baacrial done in the appropriate medium. 
The plasmid DNA can then be isolated using piasmid isolation procedures ftmiliar to those 
skilled in the art such as alkaline lysis minipreps or large scale alkaline lysis plasmid isolation 
procedures. If desired the plasmid DNA may be further enriched by centrifugation on a 

2S cesium chloride gradient, ^ze exclusion chromatography, or anion exchange chromatography. 
The plasmid DNA obtained u«ng these procedures may then be manipulated using standard 
cloning techniques frmruliar to those skilled in the art. Alternatively, a PGR can be done v^th 
primers designed at both ends of the EST insertion. The PGR product wMch corresponds to 
the S' EST can then be manipulated using standard cloning techniques familiar to those skilled 

30 in the art. 



SUBSTITUTE SHEET (RULE 26) 
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The promoters and other r^ulatory sequences located upstream of the extended - 
cDNAs or 5' ESTs may be used to design ejcpression veaors capable of directing the 
expression of an inserted gene in a desired spatial, temporal, developmental, or quantitative 
manner. A promoter capable of directing the desired spatial, temporal, developmental, and 
5 quantitative patterns may be selected using the results of the expression analysis described in 
Example 26 above. For example, if a promoter which confers a high level of expression in 
muscle is desired, the promoter sequence upstream of an extended cDNA or 5' EST derived 
from an mRNA which is expressed at a high level in muscle, as determined by the method of 
Example 26, may be used in the expression vector. 

10 Preferably, the desired promoter is placed near muhiple restriction sites to facilitate 

the cloning of the desired insert downstream of the promoter, such that the promoter is able 
to drive oqiression of the inserted gene. The promoter may be inserted in conventional 'f 
nucleic add backbones designed for extrachromosomal replication, integration into the host 
chromosomes or transient expression. Suit^le backbones for the present expression vectors 

15 include retroviral backbones, backbones from eukaryotic episomes such as SV40 or Bovine 
Papilloma Vims, backbones from bacterial episomes. or artificial chromosomes. 

Preferably, the expression vectors also include a polyA signal downstream of the 
multiple restriction sites for directing the polyadenylation of mRNA transcribed from the gene 
inserted into the e?q)ression vector 

20 J^oDowii^ the identification ofpromoter sequences using the pro^ 

58-60, proteins vftitdx interact with the promoter may be identified as described \n Example 
61 below. 



EXAMPLE 61 

25 Identification of Proteins Which Interact with Promoter Sequences. Upstream 

Regulatory Sequences> or mRNA 
Sequences within the promoter region which are likely to bind transcription factors 
may be identified by homology to known transcription fector binding sites or through 
conventional mutagenesis or deletion analyses of reporter plasmids containing the promoter 
30 sequence. For example, deletions may be made in a reporter plasmid containii^ the promoter 
sequence of interest operably linked to an assayable reporter gene. The reporter plasmids 
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canying vadous deletions within tlie promoter region are transfected into an appropriate host 
ceU and the effects of the deletions on expression levels is assessed. Transcription feaor 
binding sites within the regions in which deletions reduce expression levels may be fiirther 
localized using site directed mutagenesis, linker scanning analysis, or other techniques iamiliar 
5 to those sldDed in the art 

Nucleic acids encoding proteins which interact with sequences in the promoter 
may be identified using one-hybrid systems such as those described in the manual 
accompanying the Matchmaker One-Hybrid System kit available fiom Qontech (Catalog 
No. K1603-1), the disclosure of whidi is incorporated hernn by reference. Briefly, the 
Matchmaker One-hybrid system is used as foUows. The target sequence for which it is 
desired to identify binding proteins is cloned upstream of a selectable reporter gene and 
integrated into the yeast genome. Preferably, multiple copies of the target sequences are 
inserted into the reporter plasmid in tandem. A libraiy comprised of fiisions between 
cDNAs to be evaluated for the abUity to bind to the promoter and the activation domain 
15 of a yeast transcription fector, such as GAL4. is transfonned into the yeast strain 
containing die integrated reporter sequence. The yeast are plated on selective media to 
select cells expressing the selectable marker linked to the promoter sequence. The 
colonies wWdi grow on the seleaive media contain genes encoding proteins which bind 
the target sequence. The inserts in the genes encoding the fiision proteins are fijrther 
20 characterized by sequencing. In addition, the inserts may be inserted into expression 
vectors or in vitro transcription vectors. Binding of the polypeptides encoded by the 
inserts to the promoter DNA may be confirmed by techniques femiliar to those skilled in 
the art, such as gel shift analysis or DNAse protection analysis. 



25 m Use of 5' ESTs (or cDNAs or Genomic DNAs Obtainable Therefrom) in Gene 
Therapy 

The present invention also comprises the use of 5'ESTs (or cDNA or gnomic DNA 
obtainable therefrtmi) in therapy strategies, including antisense and tr^le helix strategies 
as described in Examples 62 and 63 below. In antisense approaches, nucleic acid sequence* 
30 complementary to an mRNA are hybridized to the mRNA intraceflularly, dierdjy blocking the 
expression of the protein encoded by the niRNA The antisense sequences may prevent i?«ie 
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expression through a variety of mechanisms. For example, the antisense sequences may 
inhibit the ability of ribosomes to translate the mRNA. Altemativdy, the antisense sequences 
may block transport of the mRNA from the nucleus to the cytoplasm, thereby limiting the 
amount of mRNA available for translation. Another mechanism through which antisense 
5 sequences may inhibit gene expression is by interfering vsath mRNA splicing. In yet another 
strategy, the antisense nucleic acid may be incorporated in a ribozyme capable of specifically 
cleaving the target mRNA 

EXAMPLE 62 

10 Preparation a nd Use of Antisense Oligonucleotides 

The antisense nucleic acid molecules to be used in gene therapy may be either DNA 
or RNA sequences. They may comprise a sequence complementary to the sequence of the ?^ 
5 'EST (or cDNA or genomic DNA obtainable therefrom). The antisense nucleic acids ^ 
should have a length and mdting temperature sufSdent to permit formation of an intraceUuIar 

1 3 duplex with suffident stability to inhftit the expression of the mRNA in the duplex. Strategies 
for designing antisense nudeic adds suitable for use in gene therapy are disclosed in Green et 
ai, Ann Rev. Biochem. 55:569-597, 1986; and Izant and Wdntraub, Ce// 36:1007-1015, 
1984, which are hereby incorporated by reference. 

In some strat^es, antisense molecules are obtained from a nucleotide sequence 

20 encoding a protein by reversing the orientation of the coding region with respect lo a 
promoter so as to transcribe the opposite strand from that which is normally transcribed in the 
cell. The antisense molecules may be transcribed using in vitro transcription systems such as 
those which employ T7 or SP6 polymerase to generate the transcript. Another approach 
involves transcription of the antisense nucleic acids in vivo by operably linking DNA 

25 containing the antisense sequence to a promoter in an expression vector. 

Altemativdy, oligonudeotides which are complementary to the strand normally 
transcribed in the cell may be synthesized in vitro. Thus, the antisense nucldc adds rc 
complementary to the corresponding mRNA and are capable of hybridizing to the mRNA :o 
create a duplex In some embodiments, the antisense sequences may contain modified sugx' 

30 phosphate backbones to increase stability and make diem less sensitive to RNase acuviry 
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Examples of modifications suitable for use in antisense strat(^es are described by Rossi et al., 
Pharmacol. Jher. 50(2):245-254, 1991. which is hereby incorporated by reference. 

Various types of antisense oBgonucleotides complementaiy to the sequence of the 
5'EST (or cDNA or genomic DNA obtainable therefix)m) may be used. In one preferred 
5 embodiment, stable and scmi-suibie antisense oligonucleotides described in International 
Application No. PCT WO94/23026. hereby incorporated by reference, are used. In these 
molecules, the 3' end or both the 3' and 5' ends are engaged in intramolecular hydrogen 
bonding between complementaiy base pairs. These molecules are better able to withstand 
exonudease attacks and exhibit increased stabiUty compared to conventional antisense 
10 oligonucleotides. 

In another preferred embodiment, the antisense oligodeoxynucleotides against herpes 
simplex virus types 1 and 2 described in International AppUcation No. WO 95/04141. hereby J 
incorporated by reference, are used. 

In yet another preferred embodiment, the covalently cross-linked antisense 
IS oligonucleotides described in Inteniational Application No. WO 96/31523. hereby 
incorporated by reference, are used. These double- or single-stranded oligonucleotides 
comprise one or more, respectively, inter- or intr»K)Iigonucleotide covalent cross-Unkages. 
wherein the linkage consists of an amide bond between a primary amine group of one strand 
and a carboxyl group of the other strand or of the same strand, respectively, the primary 
amine group being directly substinaed in the 2* position of the strand nucleotide 
monosaccharide ring, and the carboxyl group being carried by an aliphatic spacer group 
substituted on a nucleotide or nucleotide analog of the other strand or the same suand. 
respectively. 

The antisense oligodeoxynucleotides and oligonucleotides disclosed in International 
25 Application No. WO 92/18522, incorporated by reference, may also be used. These 
molecules are stable to degradation and contain at least one transcription control recognition 
sequence which binds to control proteins and are effective as decoys therefore. These 
molecules nay contain "hairpin" structures. "dumbbeU" structures, "modified dumbbell" 
strucmres, "cross-linked" decoy strucmres and "k)op" structures. 
30 In another preferred embodiment, the cyclic double-stranded oligonucleotides 

described in European Patent AppUcation No. 0 572 287 A2. hereby incorporated by 
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references used. These ligated oligonucleotide '^dumbbells" contain the binding site for a 
transcription factor and inhibit expression of the gene under control of the transcription faaor 
by sequestering the fector. 

Use of the closed antisense oligonucleotides disclosed in International Application 
5 No. WO 92/19732, hereby incorporated by reference, is also contemplated. Because these 
molecules have no free ends, they are more resistant to degradation by exonucleases than are 
conventional oligonucleotides. These oligonucleotides may be multifunctional, interacting 
with several regions which are not adjacent to the target mRNA. 

The appropriate level of antisense nucleic acids required to inhibit gene expression 
10 may be determined using in vitro expression analysis. The antisense molecule may be 
introduced into the cells by diflfuaon, injection, infection, transfection or h-region-mediated 
import using procedures known in the art. For example, the antisense nucleic acids can be *x 
introduced into the body as a bare or naked oligonucleotide, oligonudeoride encapsulated in • 
lipid, ofigonudeotide sequence encapsidated by viral protein, or as an oligonudeotide 
1 5 operabty linked to a promoter contained in an expression vector. The expression vector may 
be any of a variety of expression vectors known in the art, induding retroviral or viral vectors, 
vectors capable of extrachromosomal replication, or integrating veaors. The vectors may be 
DNAorRNA. 

The antisense molecules are introduced onto cell samples at a number of different 
20 concentrations preferably between IxlO'^V to lxIO"*M. Once the minimum concentration 
that can adequately control gene expression is identified, the optimized dose is translated into 
a dosage suitable for use in vivo. For example, an inhibiting concentration in culture of Ix 10'^ 
translates into a dose of approximately 0,6 mg/kg bodywdght. Levds of oligonudeotide 
approadang 100 mgA^ bodywdght or higher may be possible after testing the toxicity of the ' 
25 oligonudeotide in laboratory animals. It is additionafly contemplated that cdls from the 
vertebrate are removed, treated with the antisense oligonudeotide, and reintroduced into the 
vertebrate. 

It is further contemplated that the antisense oligonudeotide sequence is incorporated 
into a ribozyme sequence to enable the antisense to specifically bind and cleave its target 
30 mRNA. For technical applications of ribozyme and antisense oligonucleotides see Rossi et 
oL^ supra. 
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In a. preferred application of this invention, the polypeptide encoded by the gene is 
first identified, so that the effectiveness of antisense inhibition on translation can be monitored 
using techniques that include but are not limited to antibody-mediated tests such as RIAs and 
ELISA, functional assays, or radiolabeling. 
5 The 5' ESTs of the present invention (or cDNAs or genomic DNAs obtainable 

therefrom) may also be used in gene therapy approaches based on intracellular triple hda 
formation. Triple helbc oligonucleotides are used to inhibit transcription from a genome. 
They are particulariy useful for studying alterations in cell activity as it is assodated with a 
particular gene. The 5' EST sequences (or cDNAs or genomic DNAs obtainable therefrom) 

10 of the present invention or, more preferably, a portion of those sequences, can be used to 
inhibit gene expresaon in individuals having diseases associated with expression of a 
particular gene. Similarly, a portion of 5' EST sequences (or cDNAs or genomic DNAs 
obtainable therefrom) can be used to study the eflFect of inhibiting transcription of a particular 
gene within a cell. Tradtionally, homopurine sequences were conadered the most usefiil for 

15 triple hefix strategies. However, homopyrimidine sequences can also inhibit gene expression. 
Such homopyrimidine oligonucleotides bind to the major groove at 
homopurineihomopyrimidine sequences. Thus, both types of sequences from the 5'EST or 
from the gene corresponding to the 5'EST are contemplated within the scope of this 
invention. 

20 , 

EXAMPLES! 

Preparation and Use of Triple Hdix Probes 
The sequences of the 5' ESTs (or cDNAs or genomic DNAs obtainable therefrom) 
are scanned to identify 10-mer to 20-mer homopyrimidine or homopurine stretches which 

25 could be used in triple-helix based strategies for inhibiting gene expressioa Following 
identification of candidate homopyrimidine or homopurine stretches, their efficiency in 
inhibiting gene expression is assessed by introducing varying amounts of oligonucleotides 
containing the candidate sequences into tissue culture cells which nonmally express the target 
gene. The oligonucleotides may be prepared on an oligonucleotide synthesizer or they may 

30 be purchased commercially from a company specializing in custom oligonucleotide synthesis, 
such as GENSET, Paris, France. 
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- The oligonucleotides may be introduced into the ceUs using a variety of methods 
known to those skiUed in the an. including but not limited to calcium phosphate precipitation. 
DEAE-Dextran, electroporation, liposome-mediated transfection or native uptake. 

Treated ceils are monitored for altered cell Bmction or reduced gene expression using 
5 techniques such as Nordiem blotting. RNase protection assays, or PCR based strategies to 
monitor the transcription levels of the target gene in cells which have been treated with the 
oligonucleotide. The ceU fiinaions to be monitored are predicted based upon the homologies 
of the taiget gene corresponding to the extended cDNA torn which the oligonucleotide was 
derived with known gene sequences that have been associated with a particular fiinaion. The 
1 0 ceB fimctions can also be predicted based on the presence of abnormal physiologies v«thin 
cells derived from individuals with a particular inherited disease, particularly when the 
extended cDNA is associated with the disease using techniques described in Example 56. 

The oligonucleotides which are eflfcctive in inhibiting gene expression b tissue culture 
cells may then be introduced in vivo using the techmques described above and in Example 62 
15 at a dosage calculated based (Ml the wviiw results, as described in Example 62. 

In some embodiments, the natural (beta) anomers of the oligonucleotide units can be 
replaced with alpha anomers to render the oli^nucleotide more reastant to nucleases. 
Further, an intercalating agent such as ethidium bromide, or die Bke. can be attached to the 3' 
end of the alpha oligonucleotide to stabilize the triple helbc. For infoimaiion on the 
20 . generation of oligonucleotides suitable for triple helbc foraiation see GriflSn et al.. Science 
245:967-971, 1989, vAadi is hereby incorporateid by this reference. 



EXAMPLE 64 

Use of cDNAs Obtained Using th«» 5' ESTs to Express an Encoded Protein in a Host 
25 Organism 

The cDNAs obtained as described above using the 5' ESTs of the present invention 
may also be used to express an encoded protein in a host organism to produce a beneficial 
effea. In such procedures, ti» encoded protein may be transiently expressed in the host 
oi^anism or stably expressed in the host organism. The encoded protein may have any of the 
30 activities described above. The encoded protein may be a protein which the host organism 
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lacks or. alternatively, ihe encoded protein may augment the easting levels of the protein in 
the host organism. 

A fiill length extended cDNA encoding the signal peptide and the mature protein, or 
an extended cDNA encoding only the mature protein is introduced into the host organism. 
The extended cDNA may be introduced into the host organism using a variety of techniques 
known to those of skifl in the art For example, the extended cDNA may be injeaed into the 
host organism as naked DNA such that the encoded protein is expressed in the host organism, 
thereby produdi^ a benefidal effect 

Alternatively, the extended cDNA may be doned into an expression vector 
downstream of a promoter which is active in the host organism. The expression vector may 
be any of the expression veaors designed for use in gene therapy, including viral or retroviral 
vectors. The expression vector may be directly introduced into the host organism such that 
the encoded protein is expressed in the host organism to produce a beneficial eflfea In 
anodier ^proach, the expression vector may be introduced into cells in vitro. Cells 
15 containing the expression vector are thereafter selected and introduced into the host 
organism, where they ejqpress the encoded protein to produce a bene6cial efiFea. 



10 



EXAMPLE 65 

Use of Signal Pentides KnrMed bv S* RST s or Sequences obtained Then^ ni 
20 . to Import Prr^ tans Into Cj^k 

The short core Iqrdrophobic r^on (h) of ^nal peptides encoded by the 5'ESTS or 
extended cDNAs derived from SEQ ID NOs: 38-270 may also be used as a carrier to import 
a peptide or a protein of interest, so^alled cargo, into tissue culture cells (Lin et al.. J. Bioi 
Chm.. 270: 14225-14258. 1995; DuetaL.J. Peptide Res.. 51: 235-243. 1998; Rojas el al . 
25 NaOire Biotech., 16: 370-375, 1 998). 

When cell permeable peptides of limited size (approximately up to 25 amino ac:<Js» 
are to be translocated across cell membrane, chemical synthesis may be used in order to aCd 
the h region to either the C-terminus or the N-terminus to the cargo peptide of imerest 
Alternatively, when longer peptides or proteins are to be imported into cells, nucleic acids cin 
30 be genetically engineered, using techniques femiliar to those skilled in the art, in order to liri. 
the extended cDNA sequence encoding the h r^on to the 5* or the 3' end of a DN x 
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sequence-coding for a cargo polype,5tide. Such geneticaJIy engineered nucleic acids are then 
translated either in vitro or in vivo after uansfection into appropriate cefls, using conventional 
techniques to produce the resulting cell permeable polypeptide. Suitable hosts cells are then 
simply incubated with the cell permeable polypeptide which is then translocated across the 
S membrane. 

This method may be applied to stucfy diverse intracdlular Rinctions and cellular 
processes. For mstance, it has been used to probe fiinctionaliy relevant domains of 
intracellular proteins and to examine protein-protein interactions involved in signal 
transduction pathways (Lin et aL, sttpra, Lin et a£. 1 Biol. Chem.. 271: 5305-5308. 1996; 

10 Rojase/a/.. J, BioL Chem,^ 271: 27456-27461. 1996; Liu etoL. Proc. NatL Acad Sci, USA, 
93: 11819-11824, 1996; Rojase/a/.. Biock Biophys. Res. Commun. 234: 675-680. 1997) 

Such techniques may be used in cellular therapy to import proteins producing 
therapeutic effects. For instance, cells isolated from a patient may be treated with imported 
therapeutic proteins and then re-introduced into the host organism. 

1 5 Altemativdy, the h region of signal peptides of the present invention could be used in 

combination with a nuclear localization signal to deliver nucleic acids into ceU nucleus. Such 
oligonucleotides may be antisense oligonucleotides or oligonucleotides designed to form 
triple hdfaces, as described in examples 62 and 63 respectively, in order to inhibit processing 
and/or maturation of a target cellular RNA. 

20 

As discussed above, the cDNAs or portions thereof obtained using the 5' ESTs of the 
present invention can be used for various purposes. The polynucleotides can be used to 
express recombinant protein for analysis, characterization or therapeutic use; as markers for 
tissues in which the corresponding protein is preferentially expressed (either consiitutively or 

25 at a particular stage of tissue differentiation or development or in disease states); as molecular 
wdght markers on Southern gels; as chromosome markers or tags (when labeled) to identify 
chromosomes or to map related gene positions; to compare witii endogenous DNA 
sequences in patients to identify potential generic disorders; as probes to hybridize and thus 
discover novel, related DNA sequences, as a source of information to derive PGR primers for 

30 genetic fingerprinting; for selecting and making oligomers for attachment to a "gene chip" or 
otiier support, including for examination for expression patterns; to raise anti-protein 
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antibodies using DNA immunization techniques; and as an antigen to raise anti-DNA 
antibodies or elicit another immune response. Where the polynucleotide encodes a protein 
which binds or potcndally binds to another protein (such as, for example, in a receptor-ligand 
interaction), the polynucleotide can also be used in interaction trap assays (such as, for 
5 example, that described in Gyuris et cd.. Cell 75:791-803, 1993, the disclosure of which is 
hereby incorporated by refimnce) to identify polynucleotides encoding the other protein with 
which binding occurs or to identify inhibitors of the binding interaction. 

The proteins or polypeptides provided by the present invention can similarly be used 
in assays to detennine biological activity, including in a panel of muhiple proteins for high- 

10 throughput screening; to raise antibodies or to elicit another immune response; as a reagent 
(including the labeled reagent) in assays designed to quantitatively determine levels of the 
protein (or its receptor) in biological fluids; as markers for ussues in which the corresponding 
protein is preferentially e}q)ressed (either constitutively or at a particular stage of tissue 
dififerentiation or dcvdopment or in a disease state); and, of course, to isolate correlative 

1 S receptors or iigands. Where the protein tnnds or potentially binds to another protein (such as, 
for example, in a receptor-ligand interacdon), the protein can be used to identify the other 
protdn with which binding occurs or to identify inhibitors of the binding interaction. Proteins 
involved in these binding interactions can also be used to screen for peptide or small molecule 
inhibhors or agorusts of the binding interacdon. 

20 Any or all of these research utilities are capable of bdng developed into reagent grade 

or kit format for coramerdalization as research produas. 

Methods for performing the uses listed above are well known to those skilled in the 
art. References diseasing such methods include without limitation Molecular Cloning: A 
Laboratory Manual, 2d ed.. Cold Spring Harbor Laboratory Press, Sambrook,. Fritsch and 

25 Maniatis eds., 1989, and Methods in Enzymology: Guide to Molecular Cloning Techniques, 
Academic Press, Berger and Kimmel eds., 1987. 

Polynucleotides and proteins of the present invention can also be used as nutritional 
sources or supplements. Such uses include without limitation use as a protein or amino add 
supplement, use as a carbon source, use as a nitrogen source and use as a source of 

30 carbohydrate. In such cases the protein or polynucleotide of the invention can be added to 
the feed of a particular organism or can be administered as a separate solid or liquid 
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preparation, such as in the fonm of powder, pills, solutions, suspensions or capsules. In the 
case of nnicroorganisms, the protein or polynucleotide of the invention can be added to the 
medium in or on which the microorganism is cultured. 

Although this invention has been described in terms of certain preferred embodiments, 
5 other embodiments vAich will be apparent to those of ordinary skill in the an in view of the 
disdosure herein are also within the scope of tWs invention. Accordingly, the scope of the 
invention is intended to be defined only by reference to the appended claims. All documents 
cited herein are incorporated herein by reference in their entirety. 
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SCORE 


ro:?R 


new 


13.2 


ID39 


new 


12 


ID40 


new 


11 


ID41 


new 


10.6 


ID42 


new 


10.4 


ID43 


new 


10. 1 


TD44 


new 


9.8 


ID45 


new 


9.8 


ID46 


new 


9.5 


ID47 
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9.4 


ID48 


new 


9.4 


ID49 
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9.2 


IDjO 


new 


9.2 




new 


9.2 


ID52 


new 


9.1 


[Dj3 


new 


9.1 


£054 


new 


9.1 


\D55 


new 


9 




new 


8.9 




new 


8.8 


UJOO 


new 


8.8 


ID59 


new 


8.7 


TTvm 
1X^60 


new 


8.5 


IDol 


new 


8.4 


1U02 


new 


8.1 




new 


8 


ID64 


new 


7.6 




new 


7.5 


lUoo 


new 


7.5 


IVbl 


new 


7:5 


11/05 


new 


7.5 


TTv;o 


new 


7.4 


U//U 


new 


7.3 


1L#71 


new 


7.3 


UJ IZ 


new 


7.1 




new 


6.9 




new 


6.9 




new 


6.9 


ID76 


new 


6.9 


ID77 


new 


6.8 


ID78 


new 


6.8 


ID79 


new 


6.8 


ID80 


new 


6.8 


ID81 


new 


66 


ID82 


new 


6.5 


ID83 


new 


65 


ID84 


new 


6.4 


ID85 


new 


6.4 


ID86 


new 


6.4 


ID87 


new 


6.4 


ID8S 


new 


6.4 



TISSUE 


INTERNAL 


SOURCE 


DESIGNATION 


Testis 


51-39-3-H2-PU 


Testis 


5i.34.3-F8.PU 


Testis 


5H3-1-C5-PU 


Testis 


5l,2-4.C4-PU 


Ovao' 


26-^9. 1-A5-PU 


Testis 


5i.3-3.BlO.PU 


Testis 


51-15.4.AI2-PU 


Testis 


51.14-l<j6.PU 


Spleen 


53.I.4.AI-PU 


Ovary 


26.40.i-All-PU 


Testis 


51.l9-i-A10.PU 


Ovaiy 


26.25-2-D2-PU 


Testis 


51-17.2-C6-PU 


Ovary 


26-W.3.A6.PU 


Ovary 


26-49.1-A9-PU 


Spleen 


2O.7.2-D6.PU 


Testis 


51.2-l-All.PU 


Testis 


5i.43-3.G3.PU 


Ovary 


26.47.2-Bl.PU 


Ovary 


26-ll-l.G8.PU 


Testis 


51.37a.Ell.PU 


Ovary 


26.25-2-Ol.PU 


Testis 


51.13.1.F7-PU 


Spleen 


20.2.1-D7.PU 


Ovary 


26-12.2.B5.PU 


Testis 


51.1.1.G12.PU 


Spleen 


20.8.2.F3.PU 


Spleen 


2O-IO.3.D4.PU 


Spleen 


2O-3.3.G4.PU 


Testis 


51-10-3-B6-PU 


Oi'aiy 


26.27-3-E8.PU 


Testis 


51-44-J-A6-PU 


Testis 


51.7.2-A6.PU 


Ovary 


26-31-1-Dll.PU 


Testis 


5l.28-2-Gl.PU 


Spleen 


20.10-1-B12.PU 


Testis 


51.39.I.A5.PU 


Ovaiy 


26.23.2.A11-PU 


Testis 


51.1.4.C5-PU 


Spleen 


53.2-i-D8.PU 


Spleen 


20-3-2.C11-PU 


Testis 


51-29.4.B4-PU 


Ovary 


26-27-3.EII-PU 


Ovaiy 


26.10-1.H8-PU 


Testis 


51.18.2-G10.PU 


Spleen 


20.2.1-H12-PU 


Testis 


51-10-3.G3.PU 


Uterus 


74.9.4.H2.PU 


Ovar>' 


26.23-3-G2-PU 


Testis 


51.2^.F5-PU 


Uterus 


74.4-3.C4.PU 
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ID89 new 

D90 new 

ID91 new 

ID92 new 

ID93 new 

ID94 new 

ID9i new 

ID96 new 

ID97 new 

ID98 new 

ID99 new 

EDIOO new 

IDiOl new 

ID102 new 

ID 103 new 

ID 104 new 

©105 new 
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ED 107 new 

ID 108 new 

ID 109 new 

EDI 10 new 
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ID113 new 

IDIU new 

IDU5 new 

ID116 new 
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ID 118 new 

ID119 new 

ID 120 new 

ID 121 new 

ID122 new 

ID123 new 

ID 124 new 

ID 125 new 

ID 126 new 

ID127 new 

ID128 new 

ID129 new 

ID130 new 

ID131 new 

ID132 new 

ID 133 new 

ID 134 new 

ID135 new 

ID136 new 

ID137 new 

ID13S new 

ID139 new 

D140 new 

ID141 new 
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TISSUE 




SOURCE 


6.3 


Testis 


6.3 


Spleen 


6.2 


Ovaf>- 


6.2 


Uterus 


6.1 


Ovaiy 


6.1 


Spleen 


6.1 


Ovary 


6 


Spleen 


6 


Ovaiy 


6 


Utenis 


6 


Ovary 


5.9 


Testis 


5.9 


Testis 


5.9 


Testis 


5.9 


Ovary 


5.8 


Testis 


5.8 


Ovary 


5.7 


Ovaiy 


5,7 


Ovary 


5.7 


Ovary 


5.6 


Testis 


5.6 


Testis 


5.5 


Testis 


5.3 


Testis 


5.4 


Testis 


5.4 


Ovary 


5.4 


Testis 


5.4 


Ovary 


5.3 
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5.3 


Testis 


5.2 


Utenis 


5.2 


Spleen 


5.2 


Testis 


5.1 


Spleen 


5.1 


Uterus 


5.1 


Uterus 


5.1 


Ovary 


5 


Ovary 


5 


Ovary 


5 


Testis 
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Ovary 


5 


Utenis 


5 


Testis 


49 


Ovary 


4.9 


Ovary 


4.9 


Ovary 


4.9 


Ovary 


4.8 


Uterus 


4.8 


Ovary 


4.8 


Spleen 


4.8 


Testis 


4.8 


Spleen 


4.7 


Testis 



INTERNAL 
DESIGNATION 

5N31-3-DNPU 

20O.1-H1-PU 

26-4M^3-PU 

74.I1-4-G3-PU 

26-4-i.E9-PU 

20-2-3.a.PU 

26^-l.A9-PU 

20.1- 2-C7-PU 
26.28-4.Hl.PU 
74-8.4.CII-PU 
26.6-3-B9.PU 
51-16.4-BlO.PU 
5M7.3.F9-PU 
5M.2-D10.PU 
26-I0.4«D9-PU 
51-l8-l.C3.PU 
26a5-2-C4-PU 
26.26-3-D7.PU 
26-5-3.A8.PU 
26-47-1 .C6-PU 
5M9-1.F10-PU 
5MI-4-G10.PU 
51-39-3-n-PU 
51.2.1-ElO.PU 
51.26.2-F5-PU 

26.2- 2<J10.PU 
51-35-4.G9.PU 
26.39.I.A6-PU 
26-47.l-E2.PU 

51.26.2- C7-PU 
74-ll-3.F8.PU 
53.3.I.E2.PU 
5I-3I.3.GI2.PU 
20.6-4.G5-PU 
74^-3-Fl.pU 
74.II-I.F8-PU 
26-7-4.B3.PU 
26-5.3-FlO.PU 
26-*9-3-a.PU 

51.29.3- El-PU 
26.26-3.D2.PU 
74-9-4.B4-PU 
51-1-3-E9-PU 
26-5-l<:6-PU 
26-3-l.H5.PU 
26-5l-;-D9.PU 
26-27-3-D7-PU 

74.3- ^.D8.PU 
26-29.1-El-PU 
20-3-1-H3.PU 
5I-3-3.D8-PU 
2O-5.3.D9.PU 
5I^-4.H4.PU 
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ID 142 
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new 


ID 146 
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IDISI 
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ID152 
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ID156 
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IDI57 
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new 


ID159 
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IDI70 
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ID 174 
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ID175 
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ID178 
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ID180 


new 


ID18I 
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ID182 
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Spleen 
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Testis 


4.6 


TesQS 


4.6 


Testis 
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Testis 
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4.5 


Ovary 


4.5 


Splem 


4.5 


Ovar>' 
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Testis 


4.4 


Testis 


4.3 


Testis 
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Ovary 
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Spleeii 


4.3 


Testis 


4.3 


Spleen 


4.3 


Testis 


4.3 


Spleen 


4.2 


Testis 


4.2 


Testis 


4.2 


Ovary 


4.2 


Testis 


4.2 


Spleen 


4.2 


Uterus 


4.2 


Testis 


4.1 


Ovaiy 


4 


Testis 


4 


Testis 


4 


Ovary 


4 


Testis 


4 


Ovarv' 


4 


Ovary 


3.9 


Ovary 


3.9 


Ovary 


3.9 


Ovary 


3.9 
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3.8 


Testis 


3.8 


Spleen 


3.8 


Testis 


3.8 


Ovar}' 


3.8 


Ovary 


3.7 


Testis 


3.7 


Ovziy 


3.7 
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3.7 


Utenis 


3.7 


Testis 


3.7 


Uterus 


3.7 


Spleen 


3:7 


Testis 



INTERNAL 
DESIGNATION 

5l-5^-G12-PU 

20-9-2.F7.PU 

53.3-2- A lO-PU 

26.30-4^1.PU 

51-29-3.H6.PU 

5I-5-3-G2-PU 

51-11-3-D5.PU 

51-7-1.E7-PU 

51-27.1.G12-PU 

74-^-l-F5.PU 

26-24.1.F8-PU 

20-7-3.F6.PU 

26-1.2-A8-PU 

51.1-3.H9.PU 

51-27-1.E8-PU 

51^^.B2.PU 

26^-1-a-PU 

20-J.2.E2.PU 

51.19^.F5-PU 

20.8-4.D7-PU 

51.24-1-81 l-PU 

20-6-2<}l0.PU 

51.6- J4^-PU 
51.36.2-A9-PU 

26.7- 3-HlO-PU 
51.I-3.D9.PU 
20-2.1.81 l-PU 
74-6^-A5-PU 
51-14.3.n-PU 
26.33-3.E2-PU 
51-26-4.CT-PU 
51-25-3-F3-PU 
26-8-3-D5.PU 
5M2.3.F9.PU 
26-27-l.C5^>U 

26.1- 1-G2-PU 
26-8-3.H3-PU 
26-40-2.A9.PU 
26-24->.A3.PU 
74-5.3.BI2-PU 
5i.37.2-Gi2.PU 
20-8.2.E7-PU 
51-2-1-H9-PU 
26-46-4.DI2-PU 
26.40-l-A12.PU 
51-3-4-E2.PU 
26-;7-3-G12-PU 

26.2- ^-E12-PU . 
74-M.D6.PU 
51-36-4-A3-PU 

74.11.1. B8.PU 

20.10.2. G2.PU 
51-37a.D6-PU 



wo 99/06549 



PCT/IB98/0123I 



122 



SEQ.ID 



CATEGORY 







CD 195 


iiew 


ID196 


new- 
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ID2U 


ext-est-not-vrt 


ID212 
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ID213 
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ID214 


cxt-est-not-vrt 


ID215 


c.\t-esi-noi-vrt 


ID2I6 


cxt-ea-not-vrt 


ID217 


ext-est-not-vn 


ID218 


ext-esc-not-vit 


ID219 


est-not«<xt 


ID220 


cst-not-cxt 


ID221 


est-not-ext 


ID222 


est-not-«ct 


ID223 


cst-not-cxt 


ID224 


est*not«ext . 


ID225 


est-not-ext 


ID226 


est-noi-ext 


ID227 


est-not-ext 


ID228 


est-not-ext 


ID229- 


est-not-ext 


ID230 


est-not-ext 


ID231 


est-not-ext 


D232 


est-not-ext 


ID233 


est-not-ext 


ID234 


est-not-ext 


ID235 


est-not-ext 


ID236 


est-not-ext 


ID237 


est-not-ext 


ID238 


est-not-ext 


ID239 


est-not-ext 


ID240 


est-not-ext 


ID241 


cst-not-cxt 


ID242 


cst-not-cxt 


ID243 


est-not-ext 


ID244 


est-not-ext 


ID245 


cst-not-cxt 


ID246 


est-not-ext 


ID247 


est-not-ext 



VONHEUNE 
SCORE 



TISSUE 



INTERNAL 
DESrCNATrciM 



J.O 


Ovary 


26-27-4-G9-PU 


j.O 


Testis 


51-2.3-A6-PU 


J.O 


Ovary 


26-24-2.A3-PU 


J.O 


Uterus 


74-3-3-F6-PU 




Spleen 


20'lO-2-B2-PU 


J.J 


Testis 


51-13.2-G2.PU 


1 < 
Xj 


Testis 


51-17-4.A4-PU 


J.J 


Spleen 


20-10.3-E5-PU 


1 € 

J.J 


Testis 


51-30-I.B6-PU 


J.J 


Ovary 


26-40-2-GI2-PU 


J.J 


Ovary 


26-9-3-G4-PU 


15 7 


Testis 


51.18-4.A4.PU 


7.4 


Ovary 


26-44-I.B5-PU 


7.3 


Testis 


5|.20-1.A2-PU 


7.1 


Ovary 


26.2-1-A12-PU 


6.7 


lesus 


31-2-1-A7-PU 


5.6 


Spleen 


33-1-1-ClO-PU 


5.6 


wicms 


^ t l/x t DiA nr r 

74-10-I-BlO-PU 


5.3 


I csus 


31-31-4-Al-PU 


4.4 


I csus 


31-23-1-A2-PU 


4.1 


1CSU5 


^i-35-2-F8-PU 


3.9 


ICSMIS 


£10^ m BIT 


3 9 


Testis 


^l-34-2-H6-PU 


3-5 


UlCTuS 


74-7-2-rll-PU 


lOJ 


lesus 


CI 1 o 1 nr r 

5l-lS-l-G7.PU 


9.5 


ICSuS 


CI 1 ^1 BIT 

51-Z>-l-Ul-PU 


8,3 


Ovaxy 


26-5-1-D12-PU 


83 

O.J 


Tesos 


51-41-l.FlO-PU 


82 


v/vaiy 


26-12-1-A2.PU 


8.1 


opieen 


D3-3.3-B8-PU 


o 


Testis 


5M-4-A12-PU 


7.8 


lesus 


CI 1 0 1 V\t r 

^l-l6-l-H7-PU 


7.6 


Cf«|M»n 




7.5 




< 1 ^9 7 CIA Df 1 


7.1 






7.1 




^ 1 * i?Q or f 


6.5 




4U-o-l-Ui i-ru 


6.4 


\yyaiy 


^iC 1 All nr t 


6.4 


Testis 


J l-y-J-AlZ-rU 


6,2 


Ovary 


26.8-3-F5.PU 


6.1 


Ovary 


26-27.2.A12.PU 


6 


Uterus 


74.II.3.H4.PU 


5.8 


Ovary 


26-51-2-GIO-PU 


5.8 


Testis 


51-23.1-G2-PU 


5.7 


Uterus 


74-1-2-Hl-PU 


5.7 


Testis 


51-9-1-E7-PU 


5.3 


Testis 


51.1-4.E9.PU 


4.8 


Testis 


51-6-4.G2-PU 


4.8 


Spleen 


20-2-l-C:5-PU 


4.7 


Testis 


51-23-1-H2-PU 


4.6 


Testis 


5M9.3-H6.PU 


4.6 


Testis 


5I.IO.3.DII.PU 


4.6 


Testis 


5I.20.2.G7.PU 
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ID258 
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ID259 


est-not-ext 


[D260 
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iU2oI 


est-not-ext 


ID262 


est-not-cxt 


ID263 


est-not-cxt 


ID264 


est-not-ext 


ID265 


est-noi-ext 


ID266 


est-not-ext 


ID267 


est-not-ext 


ID268 


est-not-ext 


ID269 


ext-vit-not-genomic 


ID270 


ext-vR-not-genomic 
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VONHEUNE 


TISSUE 


INTERNAL 


SCORE 


SOURCE 


DESICNATrOM 


4.6 


Ov*arv 




4.5 


Ovsiy 


•>i£- 1 1 ^ m f 


4.4 


Ovavy 


l^^l^.Ul Pfl 


4.4 


S{^een 




4.3 


Testis 




4.3 


Spleen 




4.2 


Ovarv 


^v- J- 1 -r o-ru 


4 1 


Testis 




4.1 


Testis 




4.1 


Tesds 




3.9 


Testis 




3.8 


Spleen 


20-2-1-B4-PU 


3.8 


Spleen 


20-7-2-A6-PU 


3.8 


Ovary 


26-27-t.D3-PU 


3.8 


Ovary 


26-5-4-F9-PU 


3.8 


Uterus 


74-3.1-B9-PU 


3.7 


Spleen 


20.8-4-An-PU 


3.6 


Testis 


5M5-4-GI0-PU 


3.6 


Testis 


51-2.1-AlO-PU 


3.5 


Spleen 


53-1.1-AIO-PU 


3.3 


Testis 


51-15-4.H10-PU 


8.1 


Ovaiy 


26.36-1-DU-PU 


4 


Testis 


5I.39-2.D9-PU 



wo 99/06549 



124 



PCT/IB98/0mi 



TABLE in 



SEQ. ID 

SIGNAL PFPTTDE 



ID38 MGEASPPAPARRHLLVLLLLLSTLVIPS^A 

ID39 MAPQTLLPVLVLCVLLLQAQO 

n>40 MWTLKSSLVLLLCLTCSYA 

ID4I MLPLLLLPLLWGGSLQ 

ID42 METGALRRPQLLPLLLLLCCPSODOC 

ID43 MERLVLTLCTLPLAVA 

ID44 MMLPQWLLLLFLLFFFLFLLTRG 

ID45 MKPVLPLQXLWFCLALQLVPG 

VD46 MFRQRQETAQRSTQSCRCPRDGLFFSLFSAPLASA 

ID47 MGSSACEIAVGTKRLLLALPLALVLG 

ID48 MSNQRLPUFSLLFICFFCESFC 

ID49 MLWFLSFLLALLSLNC 

ID50 MLXISLEIXSFICCVrVLISLSWT 

CDS I MVFRNCILFILTFFSHTFC 

ID52 N4LAACPLSPGCQS 

ID53 MAWSPLFLTLrmcrVSWA - 

ID54 MLKSVLVSLCSWSPPLTS • 

ID55 VrrSKXILVSFILAALSLSTrFS - 

ID56 MKSLSLXLAVXLGLATAVSA 

ID57 MWAMESGHLLWALLFMQSLWP 

ID58 MAQTWAXLLVMGSLPSASWS 

tt)59 MKCCalJ^YLL^rLLYVWPVINA 

ID60 MRKPAAGFLPSLLKVLLLPLAPAAA 

ID61 MRQSLLFLTSWPFVLA 

ID62 MELSQMSELMGLSVLLGLLALMATA 

{?^'f'-SCI^«^WSSVSSADSracSASAAGTRha.PFQFCIJlQAI^^ 

MENQLWHKTVRCCNQYQESPHDAEDILLLLLGLrVLVNl 

ID68 • MLSXKTTLLTLSPNSVCC "-ui.iyi.vrw 

ID69 MEGPROWLVLCVLAISLA 

ID70 MKSLLFTLAVFMLLAQLVSG 

ID71 MLiOJLLFSLLISIVC 

ID72 MTPWCLACLGRRPLASLQWSLTLAWC 

ID73 MTMRHNWTPDLSPLWVLLLCAHWTL 

ID74 NTTGNNRDLFCATLSCMPATS 

1>73 MTMRHNWTTOLSPLWVLLLCAHWTL 

ID76 MKPLLETLYLLGMLVPGGLG 

J^g-^^^VCLWLTSAAMSRGDNCTOLLALGIPSn-QAWGLXVVLLGAVTLLF^ 

ID78 MHRQISFIJJJ^KPRIaWFCQ^^^VNU«CRYLLS^^LT^lVlC 

ro79 MKQWLCWVLRLECRQGLGVGEPRGLRLCLGALSAXTFVSFLHA 
fj^°LCFWVIWGEMSFSSHVSRCTWYQWDLSLLMLTLI^ 

i^SJ^™SE^''^^^^"^^^'^^'CP^GKVNLNmVKVASARfLHQVriFPaYSV>; 
ANYCFLNFDVPQYAWEraSFAAPSn-IVIUVmTSACSA 

ID82 MSTSSSSSWDhlLLESLSLSTVWNWIQA 

ID83 MVFATIGFSLKSGLALGSAGLLWCLA 



ID64 

ID65 
ID66 
ID67 



ID8I 
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ID99 
ID 100 



SEQ. ID 

NO SIGNAL PEPTTDF 

©84 NfVLLLSGSVSVGVC 

ID85 MCSQKRAVSNQGLMDLGLCXLCXVXNVFA 

ID86 MLIVLTLHSPSCDT 

ID89 MLIPVFSFSLQLLSSSCT 

n>92 iMHVECFYFLSTALGSQA 
ID93 MSPGSALALLWSLPASDLG 
ID94 MALALGSIPSSIA 
ID95 MLAjFLFCTLFSLWHP 

ID96 MAQMPLTGSYQDLEYFLECMFLHLLYTLQTISSLSG 
ID97 MALLMGLWVRTVLQG 
DD98 MINHLYLAILDCSLKLTIG 

MGRQGTLEEGILCVITWI^AhJLGKQKDENHYYKKLSLLYLCSF^^ 
MELTNKQTGTDRHEQVLRRVKQDKRISAWWCVLLEVVSOG 
IDlOl MAKRQNPTSVLGLLFSISOTWA 

ID 102 MNVLPFSYYYILFCLSLQIFRVSLA J; 

ID 103 MKCLKVNPFLFLVFNFFSYISXFLSPVCG ? 

MSWTVPVVRASQRVSSVGAhlXLCLGMALCPROA 
IDiOS MGFLXLMTLTTHVHS 
ID 106 MLFRVLLLAQLFLGSG 

ID107 MRVPEDLASKILLPGCAPGSLPLSTSAPPLRG 

IDIOS MFPHXETQVKCFWCyajlRSDLCLCQCILARA 

ID109 MKSLLFTLAVFMXLAQLVSG 

IDI 10 MHLYSCSCMRLLNVACdPFSSS 

roill MRAPLVLSPLSYQCSS 

MQ VPHLRVWTQVXDTnGYRNLGFTSMCILFHCLLS 
MQKl^IAVPMmUQGGDTCmQFLWLMHQSFQiCSNS 

MCXAGFXDHPRAARHARTSRHPLPWVCVSQXPAHRSLCLWPACLC 

IDI 15 MTSKFILVSFILAALSLS /vnAoi-ci-wrALLL 

IDU6 MHLLIFILTVHHTPS 

ID 1 17 MLSSSLMVQUSQVYS 

IDI 18- MFSYILCMLFCLFS 

IDI 19 MLFLYYVTLAFSLLVLSES 

ID 120 MLLSGLWLSSVKEC 

ID 121 MVAFSVFCFSWLMSSSSP 

ID 122 MVPLALGIGPPGCLQG 

ID123 M^^-CMGVUXVGTSRRaCLLWFC^AI^G^^ 

ID124 MSLAKSLFLRVARG 

ID125 MRLPPFLPSATLLLSAES 

ID126 MSDRKRTKFSYVQLPCPISLLPRSFKRGQrPGPSAPPLLLLLREELVTG 

ID127 MTPLGSGPPREASIAQVRGFSRTFFRVAFCFFPAFLVXVXS 

ID128 MRCSALFPLLSLLSC 

©129 MLYDQYYUISLLKLCSFCn 

ID132 MCCLSATLAFSGSFL 

ID 133 MAELDLMAPGPLPRATAQPPAPLSPDSGLRGLLLQEALG 

ID 1 34 Nm-THGNNILHL AhfFFL VACPLFG VCLX 

ID 135 MVLR\^TPWPRGSHS 

ID136 MKARLSGNLICFSFLCTLFHKSNS 



ID112 
ID113 
ID114 
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SEQ.ID 

NO- SIGNAL PFpl-lD E 

©137 MSHVCLVPQTPSLCLG 
ID 138 I^ASFVFKJPSTAYWLTSVNLncING 
ID 139 MSSSRKDHLGAXAQSPSRSSLVWTAPLVSA 
ID 140 MASPAAATYLVQSSACCPA 

roul '^AAIOTGPAPAVTKTEraVQNPDVLWDLDffEARSHADQDSNPXAEAIiJ^ m 

©142 MJNLLVGNCimCAIRASCMWlSFAKFGIFLVIFCSESFS ^ 

ID143 MLCCGPLRFLLRDPGCLLA 

©144 MRKTSFILLRMrVLPTLWT 

©145 MWWKPAPEEGVRVGLVLVXRALC 

©146 MFNFLLGNSSCVYQ 

©147 MKROAF»aM>SQLSASFLQPSLQANCPALDPAVSLSAPAFA 

©148 MKSAKLGFLLRFFIFCSLNTLLLG 

©149 MDILFPLHSVIGSHP 

©150 MLKVFRAXHPKICHFGILILLSQRQWS 

©131 MLVRNARRGSRGRSPWWRAGCLXWRJCLAASWTLS 

©152 MTKGHHHQHPLHPHPLFTLGLGYPIPTOL 

©153 MTYHXIQFSERLHILnvCLARG 

©154 MSQFPU:SPPWKPLVKVSRNLKIRMSn>WPLSVLrYCGLSOPLTLG 

©155 MFRSLTTAFFRDAMGFLLMFDLTSQ 

©156 MVLTn,PU>SANSPVNNIPTrGPNSLSYASSALSPCLX 

©157 MQRNATFIHL(SJURPSLLPTLPWU>STRL 

©158 MNELFCFHSFHPLFQ 

©159 MLTNRNYFNFLFLVQLCILA 

©160 MKLNPGQVPTWWEALCRFVGMQPCTA 

©161 MLAGFRRSAPASQSLCLNLCPCSSSLL 

©162 MKEGASFYLLFFLNDVPP 

©163 MGLECCCPPHNLRVYlErLLLKLSSQSRT 

©164 MQLCPFTSVLSIAASLLQCRL 

©165 MDVrcCFDAVEGSDFRVCCHGCVSWLOQMLQLLFKLNSTWCRA 

©166 MRQGPGAPLHCFCFTLFSYSSS 

©167 MHnLLGIWLTXHLQ 

©168 MLYGSWVCLLSAGTAFE 

©169 MLFFPli,SFRFLPSESUJCXXXXFU.GRRVVG 

©170 MPVWAILGCWGTLSRG 

©171 • MGMSGKKHFPLSWDfflQGSTEATSQGILCGSLPGPSLC 

©172 MASKILLNVQEEVrCPICLELLUPLSLDCGHSLCRA 

©173 MYYMVCLFFRLIFS 

©174 MGAGGXREIRAAAASWLRAAEHSKLAGLWSPOLVPA 

©175 MGSKCCKGGPDEDAVEIMJBRQKLLLAQLHHRKRVKAAGQIQAVVnVRCVL^ 

©176 MQQGHPHLSAGILSIHSWQLLTSAQP 

©177 MSRYEXGSSLLPFPDHFSVYSFKXXSFFEAYSISDYATCCLSLFQWCAV 

©178 MIYFIKINNKLLLLHHYLLLFrTT 

©179 MELLYLKVKRGQKDLSWALCLSQSGYY 

©180 VrrLAVTLSALGATG 

©181 MLGPPLQPGSHGKVLAPQGSSGLTPPFPCRCLITLPRSCRP 

©182 MGNVCSCCLRARYQQLXULVHFPAYS 

©183 MLYGLGSGPRCVKCIHCVWC 

©184 MHRI^fTLIJ^LKA^XNLQ^lKIHVI«MLPGPV^>LDSCPPSAHS 

©185 MLFLVLFYSAIFL 

©186 MVSLCVAALFPLQA 

©187 MSSNLFYPSILTLLLA 

©188 MGLLRKCFPVMLCG^^^H1Q^•C^CQF1LCLGTCRG 
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SEQ.ID 

NO. SIGN'ALPEPTTD^ 

ID 189 MMLPLFCSPWESGG 

ID190 MAKLLSDLSVDSARC 

ID191 MCGYWVCWGHLLPARVST 

ID 192 MKLSCAGCADTAILGLSTTLNLLS 

ID193 MIPFSGTVFSLGSCPAGPLSA 

mm MIPSSQPRFXNPACKQTVLLXDPAVSLSAPAFASA 

ID 195 MAPTTLUSDSFLTS 

ID 196 MXSUVLSLLGDaQWCLS 

roi97 MACDSFLKDALPQELSQLXFLFPLVDMREDLXYFNmPRKVA 

ID200 MARTMGVPRACKAFCSLLSSFCALHFG 

ID201 MILCFLLPHHRLQEA 

ID202 MQDYVSHAVRRHCQCFFVCFSPKIYG 

ID203 MEFAHAAECVSFALNETHVLLNLALSHFNNC 

ID2W MGNQGFPYLSPSLSVQDLLAASWLPRDAPC 

©205 MKYQWIVSGSAQLASPLLPGATP 

ID206 MGPSTPLULFLLSWSGPLQG 

ID207 MASLGHILVFCVCHXTMAKA 

ID208 MSGSSLPSALALSLLLVSGSLLP 

ID211 MGSRKCGGCLSCLUPLALWS 

ID212 MGSRKCGGCLSCLLIPLALWS 

ID213 MMVMILFGVSFVFLTHC 

10214 MSNTHTVLVSLPHHIPALT 

J£SJ^<?fQ^^"T^QVPAFLQELVDRDNSKFEEWCIEM^^ 
K^^S'i^^'^^^^GWMNQQXPhniirrLACVPM^^ lKtKQ^mCRLRS 

ID216 MFPVLGWILIAWIIILLIFT 

ID217 MFSCXaSVaCPCLNKGQS 

ID218 MRLCLINfYCSFGTLSHLTYLLLLSPIKYP 

n)219 MGKCadVAMLILGLLLLALLLPVQVSS 

©220 MGSSGLLSLLVLFVLLANVQG 

ID221 MVIXKKa»VSYLLLCGQAALLLGNLLLLHCVSRSHS 

ID222 - METGRLLSLSSLPLVLLG 

ID223 MAASLGQVLALVLVAALWG 

ID226 MCSLLYPLVTFFLLCLCIAYWAST 

ID227 MLPFLFFSTLFSSIFT 

ID228 MVALNLILVPCCAA 

ID23I MPSVNSAGLCVLQLTTAVrS 

ID232 MMLGLHFALFLLVSXYMIRS 

ID233 MALLLSVLRVLLG 

ID234 MLKSLWLSLVAWHWGEA 

ID235 MGIVTWLLXSFMSSA 

ID236 MAGKALISLSFGGAIGIMFLMLGCALP 

n>237 MKKQKHQFawaSVKLVTLSVPTSLA 

ID238 NlDGIPMSMKNEMPISQLLMnAPSLGFVLFALFVAFLLRG 

^2^' l^oam^ 
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SEQ. ID 

W SIGNAL PFPTmi; 



D24I 



ID252 
ID2S3 



''''' eI^^KSSSS"^^™^^^ 

ID243 MVFPAKRFCLVPSMEGVRWAFSCGTWLPSRA 

ID245 MAAALRVRXXXFGTRA 

©246 MKLLTHNLLSSHVRG 

©247 MGXFSRRTFCGRSGRSCRGQLVQVSRPEVSAGSLLLPAPOA 

ID250 MAAATGDPGLSKLQFAPFSSA 

©2il MFTSTGSSGLYKAPLSKSLLLVPSXLS 

MTSMTQSLREVlKA^fIXAR^^FERVLGK^ 

^^SGLW^'^^^ 

n>254 MGILLGLLLLGHLT 

ID255 MFLTVKLLLGQRCSLKVSG 

ID256 M^^VroHVRDMAAAGU^S^^mLSSLLLTMS^W 

ID257 MGTPSLSILLIGAPESPIPYFPYHSGTGRVLCPLLXAAAAP 

ID258 MVYHALDSPDDDYHALFVLCLLYAMS 

ID259 MnVLSMWLCCGFE 

ID260 MWVILSSXVPLAAM 

ID261 MLAECSSLLHPSVRG 

ID262 MQMARLLGLCAWARK 

ID263 MIPQYU'HGGKYQVLGDYSLAVVFPLHFSDLISVLYLIPKTLT 

ID264 Nma.RAGKKTFLPPLXRAFACRG 

ID265 MKREGGAAHLCSDSLPESQQ 

ID266 MVrCTGPSSGQPLSSMVTAGDRRGAPSLPYSLAACPCGSOG 

©267 MQRQLALEVIVrLSETAA 

©268 MGDYaRGYRMLGETCADCGmLQDKQR^ 

EHQLASASELPLGSRP 

ID269 MWLLYLLVPALFCRA 

©270 • ^«I-EFreK^0OCSFVLQNLNRQRKWCEYWD.MAl^VD^^^V^ 
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Minimum 
signal 

peptide score 


falso positive 
rate 


false 
negative rate 


proba(0.1) 


proba(0.2) 


3.5 


0.121 


0.036 


0.467 


0.664 


4 


0.096 


0.06 


0.519 


0.708 


4.5 


0.078 


0.079 


0.565 


0.745 


5 


0.062 


0.098 


0.615 


0.782 


5.5 


0.05 


0.127 


0.659 


0.813 


6 


0.04 


0.163 


0.694 


0.836 


6.5 


0.033 


0.202 


0.725 


0.855 


7 


0.025 


0.248 


0.763 


0.878 


7.5 


0.021 


0.304 


0.78 


0.889 


8 


0.015 


0.368 


0.816 


0.909 


8.5 


0.012 


0,418 


0.836 


0.92 


9 


0.009 


0.512 


0.856 


0.93 


9.5 


0.007 


0.581 


0.863 


0.934 


10 


0.006 


0.679 


0.835 


0.919 



TABLE IV 
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Minimum 
signal 

peptide 
score 


All ESTs 


New ESTs 


ESTs 

matching 
public EST 
closer than 
40 bp from 

beginning 


ESTs 
extending 
known 
mRNA more 
than 40 bp 


ESTs 
extending 
public EST 
more than 

40 bp 


3.5 


2674 


947 


599 


23 


ISO 


4 


2278 


784 


499 


23 


126 


4.5 


1943 


847 


425 


22 


112 


5 


1657 


523 


353 


21 


96 


5.5 


1417 


419 


307 


19 


80 


6 


1190 


340 


238 


18 


68 


6.5 


1035 


280 


186 


18 


60 


7 


893 


219 


161 


15 


48 


7.5 


753 


173 


132 


12 


36 


8 


636 


133 


101 


11 


29 


8.5 


543 


104 


83 


8 


26 


9 


456 


81 


63 


6 


24 


9.5 


364 


57 


48 


6 


18 


10 


303 


47 


35 


6 


15 



TABLE V 
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ESTs 


ESTs 
flxtendino 
known 
mRNA more 
than 40 bp 




Tissue 


All ESTs 


New ESTs 


matching 
public EST 
closer than 
40 bp from 
beqinnina 


FST« 

WW 1 d 

UAlVllVJItiy 

more than 40 

bo 


Brain 


329 


131 


75 


3 


24 


Cancerous prostate 


134 


40 


37 


1 


6 


Cerebellum 


17 


9 


1 


0 


6 


Colon 


21 


11 


4 


0 


0 


Dystrophic muscle 


41 


18 


8 


0 


1 


retai Drain 


70 


37 


16 


0 


1 


reiai Kidney 


227 


116 


46 


1 


19 


Felal liver 


13 


7 


2 


0 


0 


nean 


30 


15 


7 


0 


1 


nypenropnic prosiaie 


OO 


23 


22 


2 


2 


Kidney 


10 


7 


3 


0 


0 


LarQe Intestine 


21 


8 


4 


0 


1 


Liver 


23 


9 


6 


0 


0 


Lung 


24 


12 


4 


0 


1 


Lung (cells) 


57 


38 


6 


0 


4 


Lymph ganglia 


163 


60 


23 


2 


12 


Lymphocytes 


23 


6 


4 


0 


2 


Mtisde 




1 w 


O 


A 

u 


4 


Normal prostate 


181 


61 


45 


7 


11 


Ovary 


90 


57 


12 


1 


2 


Pancreas 


48 


11 


6 


0 


1 


Placenta 


24 


5 


1 


0 


0 


Prostate 


34 


16 


4 


0 


2 


Spleen 


58 


28 


10 


0 


1 


Substantia nigra 


108 


47 


27 


1 


6 


Surrenals 


15 


3 


3 


1 


0 


Testis 


131 


68 


25 


1 


8 


Thyroid 


17 


8 


2 


0 


2 


Umbilical cord 


55 


17 


12 


1 


3 


Uterus 


28 


15 


3 


0 


2 


Non tissue-spedflc 


S68 


48 


177 


2 


28 


Total 


2877 


947 


601 


23 


150 



TABLE VI 
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Description of Transcription Factor Binding Sites present on promoters 
isolated from SignalTag sequences 

Promoter sequence P13H2 (546 bp): 



Matrix 


Position 


Orlenutlon 


Score 


Length 


Sequence 


CMYB.OI 


•502 


♦ 


0983 


9 


TGTCAOTTG 


MYOoIoe 


-SOI 


• 


0.961 


10 


CCCAACTGAC 


38.01 


-444 


• 


0.9G0 


11 


AATA6AATTAG 


S8.01 


•425 


* 


0.966 


11 


AACTAAATTAG 


0ELTAEF1.01 


•390 




0.960 


11 


6CACACCTCAG 


GATA C 


•364 




0.964 


11 


AQATAAATCCA 


CMYB.01 


•349 


♦ 


0.958 


9 


CTTCAOTTG 




•<#*M 




v.w99 


14 


TTGTAGATAGGACA 


OATA.C 


•339 




0.953 


11 


AGATAOGACAT 


TAL!AlPHAe47^01 


-235 




O.vf J 


io 


CATAACAGATGGTAAG 


TAL1BETAE47.01 


•235 




0.963 


16 


CATAACAGATGGTAAG 


TAL1BETAITF2.01 


•235 




0.978 


16 


CATAACAGATGGTAAG 


MYGO.Oe 


•232 




0.954 


10 


ACCATCTGTT 


GATA1.04 


■217 


- 


0.953 


13 


TCAAGATAAAGTA 


IKI.01 " 


•126 


♦ 


0.963 


13 


AGTTGGGAATTCC 


IK2_01 


-126 


♦ 


0.985 


12 


AGTTGGGAATTC 




•123 




0.962 


10 


TQ60AATTCC 


GATA1 02 


•96 




0.950 


14 


TCAGTGATATGGCA 


SRY 02 


-41 


• 


0.951 


12 


TAAAACAAAACA 


E2F 02 


^3 


♦ 


0.957 


8 


TTTAGCGC 


MZF1 01 


-5 


• 


a97S 


8 


TGAGGOGA 


Promoter sequence P1584 <861bp) : 










Matrix 


Position 


Orientation 


Score 


Length 


Sequence 


NFY^QS 


•746 




0.956 


11 


GGACCAATCAT 


MZF1.01 


-738 




0.962 


8 


CCTGGfiGA 

WW 1 UUUUM 


CMYB.01 


•664 


♦ 


0.994 


9 


TGACCGTTG 


VMY8"02 


•682 




0.985 


9 


TCCAACGGT 


STAT 1)1 


^3 


♦ 


0.968 


9 


TTCCTGGAA 


STAT.01 


•673 


• 


0.951 


9 


TTCCAGGAA 


mzfCoi 


•556 


• 


0.956 


8 


TTGGGGGA 


IK2_01 


-451 


♦ 


0.965 


12 


GAATGGGATTTC 


MZF1 01 


-424 




0.986 


8 


AGAGGGGA 


SRY_02 


•396 


• 


0.955 


12 


GAAAACAAAACA 


wzfT.oi 


•216 




0.960 


8 


GAAGGGGA 


MYOO Q6 


•190 


♦ 


0.981 


10 


AGCATCTGCC 


o£lta"efi^oi 


•176 




0.958 


11 


TCCCACCTTCC 


S8 01 


5 


• 


0.992 


11 


QAGGCAATTAT 


MZF1_01 


16 


• 


0.966 


8 


AGAGGGGA 


Promoter sequence P29B6 (656 bp) : 










Matrix 


PosHlon 


Orientation 


Score 


Lengtli 


Sequence 


ARNT.01 


•311 


♦ 


0964 


18 


GGACTCACGTGCTGCT 


NMYcloi 


•309 


♦ 


0965 


12 


ACTCACGTGCTG 


USF.01 


•309 


♦ 


0985 


12 


ACTCACGTGCTG 


USF_01 


•309 




0.965 


12 


CAGCACGTGAGT 


NMYC.Ol 


-309 




0.956 


12 


CAGCACGTGAGT 


MYCMAX.02 


-309 




0.972 


12 


CAGCACGTGAGT 


USF.C 


.307 


♦ 


0.997 


8 


TCACGTGC 


USF C 


•307 




0.991 


8 


GCACGTGA 


MZFT 01 


•292 




0.968 


8 


CATOGGGA 


ELK1.02 


•105 




0.963 


14 


CTCTCCGGAAGCCT 


CETS1P54,01 


•102 


♦ 


0.974 


10 


TCCGGAAGCC 


API Q4 


•42 




0.963 


11 


AGTGACTGAAC 


AP1FJ.Q2 


•42 




0.961 


11 


AGTGACTGAAC 


PAOS.C 


46 


♦ 


1.000 


9 


TGTGGTCTC 






TABLE VII 
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CLAIMS 

1 . A purified or isolated nucleic acid comprising the sequence of one of SEQ ID 
NOs: 38-270 or comprising a sequence complementary thereto. 

2. The nucleic add of Claim 1 , wherein said nucleic acid is recombinant. 

3. A purified or isolated nucleic acid comprising at least 10 consecutive bases of 
the sequence of one of SEQ ID NOs: 38-270 or one of the sequences complementary 
thereto. 

4. A purified or isolated nucleic acid comprising at least 15 consecutive bases of 
one of the sequences of SEQ ID NOs: 38-270 or one of the sequences complememaiy 
thereto. 

5 . The nucleic acid of Claim 4, wherein said nucleic add is recomUnant. 

6. A purified or isolated nuddc add of at least IS bases capable of hybricfizing 
under stringent conditions to the sequence of one of SEQ ID NOs: 38-270 or one of the 

15 sequences complementary to the sequences of SEQ ID NOs: 38-270. 

7. The nuddc add of Claim 6, wherein said nucldc add is recombinant. 

8. A purified or isolated nuddc add encoding a human gene product, said 
human gene product having a sequence partially encoded by one of tfic sequences of SEQ ED 
NO: 38-270. 

20 9. A purified or isolated nucldc add having the sequence of one of SEQ ID 

NOs: 38-270 or having a sequence complementary thereto. 

10. A purified or isolated nucleic acid comprising the nucleotides of one of SEQ 
ID NOs: 38-270 which encode a signal peptide. 

11. A purified or isolated polypeptides comprising a signal peptide encoded by 
25 one of the sequences of SEQ ID NOs: 38-270. 

12. A vector encoding a fusion protein comprising a polypeptide and a signal 
peptide, said vector comprising a first nucleic add encoding a agnal peptide encoded by one 
of the sequences of SEQ ID NOs: 38-270 operably linked to a second nucleic acid encoding a 
polypeptide. 

^0 13. A method of direaing the extracellular secretion of a polypeptide or the 

insertion of a poiypedde into the membrane comprising the steps of 



0 
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obtaining a vector acccwding to Claim 12; and 

introdudi^ said vector into a host cell such that said fusion protein is secreted into 
the extraceUular environment of said host ceO or inserted into the membrane of said host cell. 

14. A method of impoiting a polypeptide into a cell comprising contacting said 
5 cell with a fusion protem comprising a signal peptide encoded by one of the sequences of 

SEQ ID NOs: 38-270 operabiy linked to said polypeptide. 

15. A method of making a cDNA encoding a human secretory protein that is 
partially encoded by one of SEQ ID NOs 38-270. comprising the steps of 

obtaining a cDNA comprising one of the sequences of SEQ ID NOs: 38-270; 
10 contacting said cDNA with a detectable probe comprising at least 15 consecutive 

nucleotkies of said sequence of SEQ ID NO: 38-270 or a sequence complementary thereto 
under conditions w^ch permit said probe to hybridize to said cDNA; 

identifying a cDNA which hybridizes to said dcteaabic probe; and 

isolating said cDNA which hybridizes to said probe. 

16. An isolated or purified cDNA encoding a human seaetoiy protein, said 
human secretory protein comprising the protein encoded by one of SEQ ID NOs 38-270 or a 
fragment thereof of at least 10 amino adds, said cDNA being obtainable by the method of 
Claim 15. 

17. The cDNA of Claim 16 wherein said cDNA comprises die fiill protein coding 
20 sequence paitiaOy induded in one of the sequences of SEQ ID NOs: 38-270. 

18. A method of making a cDNA con^risiis one of the sequences of SEQ ID 
NOs: 38-270, comprising the steps of 

contacting a coOecdon of mRNA molecules from human cells with a first primer 
capable of Iq^dizing to the pdyA tail of said mRNA; 
25 hybridizing saki first primer to said polyA tail; 

reverse transcribing said mRNA to make a first cDNA strand; 

making a second cDNA strand complementary to said first cDNA strand using it 
least one primer comprising at least 15 nucleotides of one of the sequences of SEQ ID NOs 
38-270; and 

30 isolating the resulting cDNA comprising said first cDNA strand and said second 

cDNA strand. 
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19. • An isolated or purified cDNA encoding a human secretory proton, said 
human secretory protein comprising the protein encoded by one of SEQ ID NOs 38-270 or a 
fragment thereof of at least 10 amino adds, said cDNA being obtainable by the method of 
Claim 18. 

5 20. The cDNA of Claim 19 wherein said cDNA comprises the fiill protein coding 

sequence partiaUy included in one of the sequences of SEQ ID NOs: 38-270, 

21. The medwd of Claim 18, wherein the second cDNA strand is made by: 

contacting said first cDNA strand with a first pair of primers, said first pair of primers 
comprising a second primer comprising at least 15 consecutive nucleotides of one of the 
10 sequences of SEQ ID NOs 38-270 and a third primer having a sequence therein which is 
included within the sequence of said first primer. 

performing a first polymerase chain reaction with said first pair of nested primers to 
generate a first PCR produa; 

contacting said first PCR product with a second pair of primers, said second pair of 
15 primers comprising a fourth primer, said fourth primer comprising at least 15 consecutive 
nucleotides of said sequence of one of SEQ ID NO:s 38-270 , and a fifth primer, said founh 
and fifth primers being capable of hybridizing to sequences within said first PCR produa; and 

performing a second polymerase chain reaction, thereby generating a second PCR 
product. 

20 22. An isolated or purified cDNA encoding a human secretory protein, said 

human secretory protein comprising die protein encoded by one of SEQ ID NOs 38-270, or a 
fragment thereof of at least 10 amino adds, said cDNA being obtainable by the method of 
Claim 21. 

23. The cDNA of Claim 22 wherein said cDNA comprises the fiiU protein coding 
25 sequence partially included in one of the sequences of SEQ ID NOs: 38-270. 

24. The method of Claim 1 8 wherein the second cDNA strand is made by: 
contaaing said first cDNA strand with a second primer comprising at least 15 

consecutive nucleotides of the sequences of SEQ ID NOs: 38-270; 
hybridizing said second primer to said first strand cDN A; and 
30 extending said hybridized second primer to generate said second cDNA strand. 
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25. * An isolated or purified cDNA encoding a human secretory protein, said 
human secretory protein comprising the protein partially encoded by one of SEQ ED NOs 38- 
270 or comprising a fragment thereof of at least 10 amino adds, said cDNA being obtainable 
by the method of Claim 24. 
5 26. The cDNA of Claim 25, wherein said cDNA conq)rises the full protein coding 

sequence partially included in of one of the sequences of SEQ ID NOs: 38-270. 

27. A method of making a protein comprising one of die sequences of SEQ ID 
NO: 271-503, comprising the steps of: 

obtaining a cDNA encoding the full protein sequence partially inchided in one of the 
10 sequences of sequence of SEQ ID NO: 38-270; 

inserting said cDNA in an oqjression vector sudi that said cDNA is operably linked 
to a promoter; 

introducing said cxpresaon vector into a host cell whereby said host cell produces the 
protein encoded by said cDNA; and 
I S isolating said proteia 

28. An isolated protein obtainable by the method of Claim 27. 

29. A method of obtaining a promoter DNA comprising the steps of: 
obtaining DNAs located upstream of the nucleic adds of SEQ ID NO: 38-270 or the 

sequences complementary thereto; 
20 . screening said upstream DNAs to identify a promoter cs^^able of (Greeting 

transcription initiation; and 

isolating said DNA comprising said identified promoter. 

30. The method of Claim 29, wherdn said obtaining step comprises dvomosome 
walking from said nuddc adds of SEQ ID NO: 38-270 or sequences complementaiy thereto. 

25 31. The mediod of Claim 30, wherein said screening step comprises inserting said 

upstream sequences into a promoter reporter vector 

32. The method of Claim 30, wherein said screening step comprises identifying 
motife in said upstream DNAs which are transcription factor binding sites or transcription 
start sites. 

30 33 . An isolated promoter obtainable by the method of Claim 32. 
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34. • An isolated or purified protein comprising one of the sequences of SEQ ID 
NO: 271-503. 

35. In an array of discrete ESTs or fi:agments thereof of at least 1 5 nucleotides in 
length, the in5)roveinent comprising inclusion in said array of at least one of the sequences of 

5 SEQ ID NOs: 38-270, or one of the sequences complementary to the sequences of SEQ ID 
NOs: 38-270, or a fiagment thereof of at least 15 consecutive nucleotides. 

36. The array of Claim 35 inchiding therein at least two of the sequences of SEQ 
ID NOs: 38-270, the sequences complementary to the sequences of SEQ ID NOs: 38-270, or 
fragments thereof of at least 1 5 consecutive nucleotides. 

0 37. The array of Claim 35 including therdn at least five of the sequences of SEQ 

ID NOs: 38-270, the sequences complementary to the sequences of SEQ ID NOs: 38-270, or 
fragments thereof of at least 15 consecutive nucleotides. 
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SEQUENCE LISTING 

(I) GENERAL INFORMATION: 

(i) APPLICANT: • . 

(A) NAME : GENSET 3A 

:3) STREET :24, RUE ROYALS 

•:C) CITY: PARIS 

(I) COUNTRY : FRANCE 

{F) POSTAL CODE (21?) : 75009 



(ii) TITLE or INVENTION: 5' ESTs FOR SECRETED PROTEINS 
EXPRESSED IN TESTIS AND OTHER TISSUES 

(iii) NUMBER OF SEQUENCES: 503 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy Disk 
(3) COMPUTER: IBM Pc' compatible 

(C) OPERATING SYSTEM: Win95 

(D) SOFTWARE: Word 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDMESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Other nucleic acid 

(ix) FEATURE: 

(A) NAME/KEY: Cap 

(B) LOCATION: 1 

(D) OTHER INFORMATION: m7Gppp added to 1 
(xi) SEQUENCE DSSCRIPTIOM: SEQ ID NO: 1: 



CCCAUCCUAC UCCCAUCCAA UUCCACCCUA ACUCCUCCCA UCUCCAC 



m 



47 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRA:4DEDNESS : SINGLE 

ID) TOPOLOGY: LINEAR 

(ii) XOLECULE TYPE: 0-:h^r -.ucieic acid 
(xi) SEQUENCE DESCRIPTIOM: SEQ ID NO: 2: 



-.yCCUAC-J CZCAUCCAAU UCCkCS.OJiz, CUCCLCCCAU CUCCAC 4 6 
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•2) INFORMATION FOR SEQ ID SO: 3: 

(il SEQUENCE .CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 



ATCAAGAATT CGCACGAGAC CATTA 



(2) INFORMATION FOR SEQ ID NO: 4: 

(1) SEQUtNCr: CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 
(3) TYPE: NUCLEIC ACID 
(C) STRANDEDNESS: SINGLE 
(DJ TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 



TAATGGTCTC GTGCGAATTC TTGAT 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(3) TYPE: NUCLEIC ACID 

iC) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 



CCGACAAGAC CAACGTCAAG GCCGC 



(2) INFORM=iTIOM FOR SEQ ID NO: 6: 

(i; SEO'jENCE CHARACTERISTICS: 

';••.) LENGTH: 25 base pairs 
,Z) TV?£: NUCLEIC ACID 
C) STPANDE0NE3S : SINGLE 
'/ TOPOLOGY: LINEAR 
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(ii) .MOLECULE TYPE: Ct^er nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 



TCACCAGCAG GCAGTGGCTT AGGAG 



(2} INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 



AGTGATTCCT GCTACTTTGG ATGGC 



{2) INFORMATION FOR SEQ ID NO: 8: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Other nucleic acid 
(xi; SEQUENCE DESCRIPTION: SEQ ID NO: 8 



GCTTGGTCTT 3TTCTGGAGT TTAGA 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii> XOLECULE TYPE: Other nucleic acid 
(xi: SEQUENCE DESCRIPTION: SEQ ID NO: 9 



tccagaatg:.^ xagacaagcc aattt 



:2) INF0?,;:;.T:0M for SSQ id NC: IC: 
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;i) .IZ'^LZ^iCZ CHAK.-^.CTE;?J3TICS: 

(A) LENGTH: 25 base pairs 
(Bi TYPE: NUCLEIC ACID 
(C? STRANDEDNESS: SINGLE 
(D) TOPOLOGY: LINEAR 

t'ii) >;OLECULE TYPE: Other nucleic acid 

iKi) SZQ'CZnCZ DESCRIPTION: SEQ ID NO: 10; 

AGGGAG3AGG .A.A.ACAGCGTG AGTCC 25 



(2) INrCRMATION FOR SEQ ID NO: 11: 

(L) SEQUEN'CE CHARACTERISTICS: 

(A) LENGTH: 2S base pairs 
(S) TYPE: NUCLEIC ACID 
{O STRANDEDNESS: SINGLE 
(D) TOPOLOGY: LINEAR 

(ill WOLECULE TYPE: Other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATGGGAAAGG AAAAGACTCA TATCA 25 



(2) INF0R:-1ATI0N for SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

■;A) LENGTH: 25 base pairs 
{3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii; MOLECULE TYPE: Other nucleic acid 

SEQUENCE DSSCRIPTIOiS : SEQ ID NO: 12: 

AGCAGCAACA ATCAGGACAG CACAG 25 



(2; INFORMATION FOR SEQ ID NO: 13: 

ii) SZQ'CZUCZ CHARACTERISTICS: 

:A1 LENGTH: 25 base pairs 

TYPE: NUCLEIC ACID 
:Z) STRANDEDNESS: SINGLE 

TOPOLOGY: LINEAR 

::'.;L^C*jLE type: Other nucleic acid 
5E:;ENCE DESCRIPTION: SEQ ID NO: 13: 
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ATCAAGAATT CGCACGAGAC CATTA 25 

(2) IMFORMATIOM FOR 3EQ ID NO: 14: 

/I) SIQUEMCE CHARACTERISTICS: 

(A; LENGTH: C7 base paira 
(B: TYPE: NUCLEIC ACID 
iC) 3TRAN0SDMESS: SINGLE 
iO) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Other nucleic acid 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

ATCGTTGAGA CTCGTACCAG CAGAGTCACG AGAGAGACTA CACGGTACTG gxtt-ttt^xT 60 
TTTTTVN 6'' 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQLJENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNES3: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Other nucleic acid 
(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CCAGCAGAGT CACGAGAGAG ACTACACGG 29 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i! SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 
(0) TOPOLOGY: LINEAR 

{ID ;<OLECULE TYPE: Other nucleic acid 

{:■::.) SEQUENCE DESCRIPTION: SEQ ID KO: 16: 

CACGAGAGAG AC7ACACGGT ACTGG 25 

(2: :^:^.^^iAT:o^; for seq id no: i^: 
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s?:oc;e:mce characteristics: 

(At LENGTH: 526 base pairs 

(3i type:: nucleic acid 

(C) STRANDEDNESS: LOUBLE 
O; TOPOLOGY: LINEAR 

MLECULE TYPE: CDMA 

(vi: ORIGINAL SOURCE: 

(A) ORGAMISM: Homo Sapiens 

(D TISSUE TYPE: Lymph ganglia 

(ix) FEATURE: 

:A) NAME/KEY: other 

O) LOCATION: coraplement ( 261 . . 37 6 ) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 95 

region 166. .281 
id N70479 
est 

iiK) FEATURE: 

(A) NAME/KEY: Other 

{3) LOCATION: complement ( 380 .. 486) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 97 

region 54 . .160 
id N70479 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: complement ( 110 1 4 5 } 

(C) IDENTIFICATION METHOD: blastn 
(0) OTHER INFORMATION: identity 94 

region 403. . 438 
id N70479 
est 

(i:-: FEATURE: 

(A) NAME/KEY: Other 

(3) LOCATION: complement ( 135 229) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: ident^-ry 94 

region 315 . . 348 
id N70479 
est 

(ix; FEATURE: 

(A) NAME/KEY: sig peptide 
tB) LOCATION: 90.Tl4 0 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 8.2 

seq LLLITAILAV.AVG/FF 

SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

.-.GCTACAAIA TTCCAGGGCC ARTCACTTCC CATTTCTCAT .^^ACAGCGTCA 50 

i;-. :-.:tgactg.-.r acgtttg.ag atg .^^j^.g a.^a gtt ctc ctc tt:- .atc 1:3 
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Mei Lys Lys Val Leu Leu Leu lie 
-13 -10 

ACA GCC ATC TTG GCA GTG GC? GTW GGT TTC CCA GTC TC7 CAA GAC GAG -cl 
Thr Ala 11^ Leu Ala Val Ala Val Gly Phe Pro Vai Ser Gin Asp Gin 
-5" 1 5 

GA.^ CGA GA-^ AAA AGA AGT ATC AGT GAC AGC GAT G.^.^ TTA GC? TCA GGR 205 
Glu Arg Glu Lys Arg Ser lie Ser Asp Ser Asp Glu Leu Ala Ser Gly 
10 15 2C 

WTT TTT GTG TTC CCT TAC CCA TAT CCA TTT CGC CCA CTT CCA CCA ATT 257 
Xaa pne Val Phe Pro Tyr Pro Tyr Pro ?he Arg Pro Leu Pro i'ro lie 
25 30 35 

CCA TTT CCA AGA TTT CCA TGG TTT AGA CGT AAN TTT CCT ATT CCA ATA 305 
Pro Phe Pro Arg Phe Pro Trp Phe Arg Arg Xaa Phe Pro He Fro He 
40 45 30 55 

CCT GAA TCT GCC CCT ACA ACT CCC CTT CCT AGC GAA AAG TAAAC^y^RAA 354 
Pro Glu Ser Ala Pro Thr Thr Pro Leu Pro Ser Glu Lys 
60 65 

GGAAAA.GTCA CRATAAACCT GGTCACCTGA AATTGA.AATT GAGCCACTTC CTTGAARAAT 414 

CAAAATTCCT GTTAATAAAA RAAAAACAAA TGTAATTGAA ATAGCACACA GCATTCTCTA 4 74 

GTCAATATCT TTAGTGATCT TCTT7AATAA ACATGAAAGC AAAAAAAAAA AA 52 G 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

Hi) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Honio Sapiens 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: i..T7 

(C) IDENTIFICATION METHOD: Vcn Heijne matrix 

(D) OTHER INFORMATION: score 3.2 

seq LLLITAILAVAVG/F? 

ixi) SEQUENCE DESCRIPTION: SEQ 10 NO: 13: 



Mec 1/s Lys Val Leu Leu Leu He Thr Ala He Leu Ala Val Ala Val 
- 5 10 15 
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5IQUENCE CHARACTERISTICS: 
(A) LENGTH: 822 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRAMDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 



(i-. ! >!OLECULE TYPE: CDNA 



(VI. JKI3INAL SGUrCCt.: 

•:A) ORGANISM: Honto Sapiens 

{D) DEVELOPMENTAL STAGE: Fe-ai 

(F) TISSUE TYPE: kidney 

(ix: FEATURE : 

(A) NAME/KEY: czt\er 

(B) LOCATION: 260. ,4 64 

(C) IDENTIFICATION METHOD: bias^n 

(D) OTHER INFORMATION: identity 96 

region 153 . . 357 
id H57434 



iL:<) FEATURE: 

(Al NAME/KEY: other 
(3) LOCATION: 1X8.. 184 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 98 

region 98. .164 
id H57434 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 56.. 113 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 99 

region 35, . 92 
id H57434 
est 



(ir.x) FEATURE: 

(A) NAME/BCEY: other 

(B) LOCATION: 454.. 485 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 343. .379 
id H574 34 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: 118.. 545 

(C) IDENTIFICATION METHOD: blastn 

(0) OTHER INFORMATION: identity 98 

region i . . 428 
id N27243 
est 



FEATURE : 
^A) tL^MZ/KZY: other 
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" (B) LOCATION: 65 . . 369 

(C) IDEvN'TinCATION METKOC: blasrn 
iO) OTHER INrORM-ATION: identity 98 

region 4 1. .345 
id HS4779 
est 

lix; FiATJRE: 

(A; NAI-I£/K£r: other 

(3} LOCATION: 61.. 399 

(C: IDENTiriCATIOM METHOD: biastn 

(0) OTHER INFORMATION: identity 99 

region 6. .34 4 

id H09380 

est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(5) LOCATION: 408.. 453 

(C) IDENTIFICATION METHOD: biastn 

(0) OTHER INFORMATION; identity 92 

region 355. . 405 

id K0988O 

est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 60.. 399 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORM.ATION: identity 97 

region 56. .395 
id H29351 
est 



(ix: FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 393.. 4 32 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 90 

region 391.. 430 
id H29351 
est 



{ix} FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 34 6.. 408 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.5 

seq SFLFSALVIWTS.VAF 

{xi: SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



.ACTCCTTTTA GCATAGG3GC TTCGGCGCCA GCGGCCAGCG CTAGTCGGTC TGGTAAGTGC 60 

CTGATGCCG.-. GTTCCGTCTC TCGCGTCTTT TCCTGGTCCC AGGCAAAGCG GASGNAGATC 120 

CTCPAACGr^r rtTAGTrrrrr hc^cttccgg agaaaatcag cggtctaatt aattcctctg isc 

3T7-;GTTG.-.-. GCAGTTACCA AG.^.ATCTTCA ACCCTTTCCC ACAAAAGCTA ATTGAGTACA 2 AC 
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CGTTCCTGTT GAGTACACGT TCCTGTTGA7 TTACatAAAGG TGCAGGTATG AGCAGGTCTG 3 30 

AAGACT.^iACA TTTTGTGAAG TTGTAAAACA GAAAACCT3T TAGAA ATG TGG TGG 7TT 35:? 

Met Tro Trp Phe 
-20 

CAG CPA GGC CTC AGT TTC CTT CCT TCA GCC CTT GTA ATT TGG ACA TCT 4C5 
Gin Gin Gly Leu Ser Phe Leu Pro Ser Ala leu Val lie Trp Thr Ser 
-15 -10 -5 

GCT GCT TTC ATA TTT TCA TAC ATT ACT GCA GTA ACA CTC CAC CAT ATA 45 i 

Ala Aia Phe lie Phe Ser Tyr lie Thr Ala Val Thr Leu His His He 
15 10 IS 

GAC CCG GCT TTA CCT TAT ATC AGT GAC ACT GGT ACA GTA GCT CCA RAA 501 
Asp Pro Aia Leu Pro Tyr lie Ser Asp Thr Giy Thr Vai Ala Pro Kaa 
20 25 30 

A?JK TGC TTA TTT GGG GCA ATG CTA AAT ATT GCG GCA GTT TTA TGT CAA 54 9 

Lys Cys Leu Phs Gly Aia Met Leu Asn lie Aia Ala Val Leu Cys Gin 
35 40 45 

AAA TAGAAATCAG GAARATAATT CAACTTAAAG AAKTTCATTT CATGACCAAA 502 
Lys 

CTCTTCARAA ACATGTCTTT ACAAGCATAT CTCTTGTA7T GCTTTCTACA CTGTTGAATT 662 

GTCTGGCAAT ATTTCTGCAG TGGAAAATTT GATTTARMTA GTTCTTGACT GATAAATATG 722 

GTAAGGTGGG CTTTTCCCCC TGTGTAATTG' GCTACTATGT CTTACTGAGC CAAGTTGTAW 732 

TTTGAAATAA AATGATATGA GAGTGACACA AAAAAAAAAA 822 



(2) INFOEIMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 



(ix) FEATURE: 

(A) NAME/KEY: sig__pepticle 
(3) LOCATION: 1. .21 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(0) OTHER INFORMATION: score 5.5 

seq SFLPSALVIWTSA/AF 

{y.i) SEQUENCE DESCRIPTION: SEQ ID KO: 20: 



-.:r. Trp Trp Phe Gin Gin Gly Leu Ser t^r.e Leu Pro Ser Aia Leu Val 
1 5 10 15 
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lie Tro Thr. Ser Ala 
20 



(2) INrORI-L^TICN FOR SHTQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 405 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE-: CDNA 

{vi) ORIGINAL SOURCE: 

■A) ORGANISM: Homo Sapiens 
■?) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: complenent { 103 . . 398 ) 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 96 

region 1. .296 
id AA442893 
est 

(ix) FEATURE: 

{A) NAME/KEY: sig^peptide 
(3) LOCATION: 185.. 295 

(CI IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score 5.9 

seq LSYASSALSPCLT/AP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



ATCACCTTCT TCTCCATCCT TSTCTGGGCC AGTCCCCARC CCAGTCCCTC TCCTGACCTG 60 

CCCAGCCCAA GTCAGCCTTC AGCACGCGCT TTTCTGCACA CAGATATTCC AGGCCTACCT 120 

GGCATTCCAG G.\CCTCCGMA ATGATGCTCC AG7CCCTTAC AAGCGCTTCC TGGATGAGGG 180 

TGGC ATG G7G CTG ACC ACC CTC CCC TTG CCC TCT GCC AAC AGC CCT GTG 229 
Met Val Lea Thr Thr Leu Pro Leu Pro Scr Ala Asn Ser Pro Val 
-35 -30 -25 

AAC ATG CCC ACC ACT GGC CCC AAC AGC CTG AGT TAT GCT AGC TCT GCC 27 7 

Asa Mec Pro Thr Thr Gly Pro Asn Ser Leu Ser Tvr Aia Ser Ser Aia 
-20 -15 ^ -10 

CTG TCC CCC TGT CTG ACC GCT CCA AAK TCC CCC CGG CTT GCT MQ ATG 325 
Leu Ser Pro Cys Leu Thr Ala Pro Xaa Ser Pro .Arg Leu Ala Met Met 

I 5 10 



CCT GA: A-.C TAAATATCCT TATCCAAATC AATAAARWRA RAATCCTCCC TCCARA.AGGG 334 
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(2) INFORMATION FOR SZQ ID NO: 22: 

(i) SEQUENCE -CHARACTERISr ICS: 

(A) LENGTH: 37 amino acids 
(3) TYPE: ?mm ACID 
:D) TOPOLOGY: LINEAR 

. (ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(ix) FEATURE: 

(A) NAME /KEY: sig_peptide 

(B) LOCATION: 1 . . 37 

iC) IDENTIFICATION METHOD: Von Heijne matrix 
{0} OTHER INFORMATION: score 5.9 

seq LSYAS3ALSPCLT/AP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



Met Val Leu Thr Thr Leu Pro Leu Fro Ser Ala Asn Ser Pro Val Asn 
15 10 15 

Met Pro Thr Thr Giy Pro Asn Ser Leu Ser Tyr Aia Ser Ser Ala Leu 
20 25 30 

Ser Pro Cys Leu Thr 
35 



12) INFORf!ATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 96 base pairs 

(3) TYPE; NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F) TISSUE TYPE: Cancerous prostate 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 149,. 331 

(C) IDENTIFICATION METHOD: fciastn 
(0) OTHER INFORMATION: identity 98 

region 1 . . 133 
id AA397994 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 
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iB) LOCATION: 328. .465 

(C) IDENTIFICATION METHOD: blastn 

{D) OTHER INFORMATION: identitiy 56 

region 179. . 336 
id AA397994 
est 

:lx} F£ATUR£: 

(A) NAME/KEY: other 

(B) LOCATION: compiement ( 182 . . 496) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 97 

region 14 . . 328 
id AA399680 
est 



(ix) FEATURE: 

(A) NAME/KEY: sig^peotide 
(3) LOCATION: 196.. 240 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.5 

sec ILSTVTALTFAXA/LD 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



AAA.A.2ATTGG TCCCAGTTTT CACCCTGCCG CAGGGCTGGC TGGGGAGGGC .'^GCGGTTTAG 60 

ATTAGCCGTG GCCTAGGCCG TTTAACGGGG TGACACGAGC NTGCAGGGCC Q^GTCCAAGG 120 

CCCGGAGATA GGACCAACCG TCAGGAATGC GAGGAATGTT TTTCTTCGGA CTCTATCGAG 180 

GCAC.ACAGAC AGACC ATG GGG ATT CTG TCT ACA GTG ACA GCC TTA ACA TTT 231 
.Met Gly lie Leu Ser Thr Val Thr Ma Leu Thr Phe 
-15 -10 -5 

GCC ARA GCC CTG GAC GGC TGC AGA AAT GGC ATT GCC CAC CCT GCA AGT 279 
Ala Xaa Aia Leu Asp Gly Cys Arg Asn Gly He Ala His Pro Ala Ser 
1 5 10 

GAG AAG CAC AGA CTC GAG AAA TGT AGG GAA CTC GAG ASC ASC CAC TCG 327 
Glu L/s His Arg Leu Glu Lys Cys Arg Giu Leu Glu Xaa Xaa His Ser 
IS 20 25 

GCC CCA GGA TCA ACC CAS CAC CGA AGA AAA ACA ACC AGA AGA AAT TAT 375 
Ala Pro Gly Ser Thr Xaa Hia Arg Arg Lys Thr Thr Arg Arg Asn Tyr 
30 35 40 45 

TCT TCA GCC TGAAATGAAK CCCCCATCAA ATGGTTGCTG ATCARAGCCC AT.ATTTAA.A7 4 34 
Ser Ser Ala 



TGGAAAAGTC AAATTGA3CA TTATTAAATA AAGCTTGTTT AATATGTCTC AA.ACAAAAAA 4 94 

4 05 



INFORMATION FOR SEQ ID NO: 24: 
orQ:TrNr:E CHARACTERISTICS: 
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..(A) LE>JGTK: 15 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

;ii; molecule: type: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptice 
(3) LOCATION: 1 . . 15 

(C) IDENTinCATICN METHOD: Von Heijne matrix 

(D) OTHER INFORMATION; score 5.5 

seq ILSTVTALTrAXA/LD 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



Met Gly lie Leu Ser Thr Val Thr Ala Leu Thr Phe Ala Xaa Ala 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 base pairs 

(B) TYPE; NUCLEIC ACID 
iC) STRANDEDNESS: DOUBLE 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 4 9.. 96* 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 10.1 

seq LVLTLCTLPLAVA/SA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



AAAGATCCCT GCAGCCCGGC AGGAGAGAAG GCTGAGCCTT CTGGCGTC ATG GAG AGG 57 

Met Glu Arg 
-15 

CTC GTC CTA .ACC CTG TGC ACC CTC CCG CTG GCT GTG GCG TCT GCT GGC 105 
Leu Val Leu Thr Leu Cys Thr Leu Pro Leu Ala Val Ala Ser Ala Gly 
-10 -5 1 

TGC GCC ACG ACG CCA GCT CGC A.AC CTG AGC TGC TAC CAG TGC TTC AAG 153 
Cys Ala Thr Thr Pro Ala Arg Asn Leu Ser Cys Tvr Gin Cys Phe Lys 
5 10 15 

GTC .AGC AGC TGG ACG G^^G TC^C r.C.a CCC ACC TGG TGC AGC CCG CTG G.AC 201 
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b 

Val Ser Ser Trp Thr Glu Cys Pro Pro Thr Trp Cys Ser Pro Lsu Asp 
20 25 30 35 

CAA GTC TGC A?C TCC AAC GAG GTG GTC GTC TCT TTT AA.^^ TGG AGT GTA 2 0 

Gir. Val Cys lie 6er Asn Giu Val Val Val Ser Phe Lys Trp Ser Val 
40 45 ' 50 

CGC GTC CTG CTC AGC AAA CGC TGT GCT CCC AGA TGT CCC AAC GAC AAC 2 97 

Arg Vai Leu Leu Ser Lys Arg Cys Aia Pro Arg Cys Pro Asn Aso Asn 
55 60 65 

ATG AAK TTC GAA TGG TCG CCG GCC CCC ATG GTG C.^ GGC GTG ATC ACC 34 5 

Met Xaa Phe Giu Trp Ser Pro Ala Pro Met Val Gin Gly Val lie Tnr 
70 75 80 

AGG CGC TGC TGT TCC TGG GCT CTC TGC AAC AGG GCA CTG ACC CCA CAG 393 
Arg Arg Cys Cys Ser Trp Aia Leu Cys Asn Arg Aia Leu Thr Pro Gin 
85 90 95 

GAG GGG CGC TGG GCC CTG CRA GGG GGG CTC CTG CTC CAG GAC CCT TCG 441 
Giu Giy Arg Trp Ala Leu Xaa Gly Glv Leu Leu Leu Gin Asp Pro Ser 

lUO 105 110 115 

AGG GGC ARA AAA ACC TGG GTG CGG CCA CAG CTG GGG CTC CCA CTC TGC 489 
Arg Gly Xaa Lys Thr Trp Val Arg Pro Gin Leu Gly Leu Pro Leu Cys 
120 125 130 

CTT CCC AWT TCC AAC CCC CTC TGC CCA RGG GAA ACC CAG GAA GGA 534 
Leu Pro Xaa Ser Asn Pro Leu Cys Pro Xaa Glu Thr Gin Glu Gly 
135 140 145 

TAACACTGTG GGTGCCCCCA CCTGTGCATT GGGACCACRA CTTCACCCTC TTGGARACAA 594 

TAAACTCTCA TGCCCCCAAA AAAAAAAAA 623 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 
(S) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Homo Sapiens 

(ix) FEATURE: 

(A) NAME/KEY: sig_oeptide 
(3) LOCATION: 1. . 16 

\Z) IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score 10.1 

seq LVLTLCTLPLAVA/SA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 25: 



Glu Arg leu Vai Leu Thr Leu Cys Thr Leu Pro Leu Ala Vai Aia 
5 ' 10 13 
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(2) INFORMATION FOR SSQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

:A) LENGTH: 848 base pairs 

<:B) TYPE: NUCLEIC ACID 

<C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(D) DEVELOPMENTAL STAGS: Fetal 

(F) TISSUE TYPE: kidney 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 32.. 73 

(C) IDENTIFICATION METHOD: Von Heijne matri.x 

(D) OTHER INFORMATION: score 10. 7 

seq LWLLFFLVTAIHA/EL 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



AACTTTGCCT TGTGT7TTCC ACCCTGAAAG A ATG TTG TGG CTG CTC TTT TTT CTG 55 

Met Leu Trp Leu Leu Phe Phe Leu 
-10 

GTG ACT GCC ATT CAT GCT GAA CTC TGT CAA CCA GGT GCA GAA AAT GCT 103 
Val Thr Ala He His Ala Glu Leu Cys Gin Pro Gly Ala Glu Asn Ala 
-5 15 10 

TTT AAA GTG AG A CTT AGT ATC AGA ACA GCT CTG GGA GAT AAA GCA T.AT .151 
Phe Lys Val Arg Leu Ser He Arg Thr Ala Leu Gly Asp Lys Ala Tyr 
15 20 25 

GCC TGG GAT ACC .AAT GAA GAA TAC CTC TTC AAA GCG ATG GTA GCT TTC 199 
Ala Trp Asp Thr Asn Glu Glu Tyr Leu Phe Lys Ala Met Val Ala Phe 
30 35 40 

TCC ATG AGA AAA GTT GCC AAC AGA GAA GCA ACA GAA ATT TCC CAT GTC 24 7 

Ser Met Arg Ly3 Val Pro Asn Arg Glu Ala Thr Glu lie Ser His Vai 
45 50 55 

CTA CTT TGC AAT GTA ACC CAG AGG GTA TCA TTC TGG TTT GTG GTT ACA 295 
Leu Leu Cys Asn Val Thr Gin Arg Val Ser Phe Tro Phe Val Val Thr 
60 65 70" 

GAC CCT TCA AAA AAT CAC ACC CTT CCT GCT GTT GAG GTG CAA TCA GCC 34 3 

Asp Pro Ser Lys Asn His Thr Leu Pro Ala Vai Giu Val Gin Ser Ala 
75 80 95 90 

ATA AGA ATG A-AC AAG AA.C CGG ATC AAC AAT GCC TTC TTT CTA AAT GAC 391 
lie Arg Mec Asn Lys Asn Arg He Asn Asn Ala Pha Phe Leu Asn i\so 
95 100 105 

C.H.A ACT CTG GAA TTT 7TA AAA ATC CCT TCC ACA CTT GCA CCA CCC AT3 .4 39 
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Gin Thr Leu 3iu ?ae Leu Lvs lie Pro Ser Thr Leu Ala Pro Pro Met 

'* no us 123 

GAC CCA TCT GTG CCC ATC TGG ATT ATT ATA TTT GGT GTG ATA TTT TGC 4 3? 

Asp Pro Ser Vai Pro lie Trp lie lie lie P!^e Gly Val lie Phe Cys 
125 • 130 " 135 

ATC ATC ATA GTT GCA ATT GCA CTA CTG ATT TTA TCA G3G ATC TGG CAA 535 
lie lie lie Val Ala lie Ala Leu Leu He Leu Ser Gly lie Trp Gin 
140 145 150 

CGT ADA ARA AAG AAC AAA GAA CCA TCT GAA GTG GAT GAC GCT GAA RAT 583 
Arg Xaa Xaa Lys Asn Lys Glu Pro Ser Giu Vai Asp Asp Aia Giu Xaa 
155 i60 165 " 170 

AAK TGT GAA AAC ATG ATC ACA ATT GAA AAT GGC ATC CCC TCT GAT CCC 631 
Xaa Cys Giu Asn Met lie Thr lie Giu Asn Gly lie Pro Ser Asp Pro 
175 180 185 

CTG GAC ATG AAG GGA GGG CAT ATT AAT GAT GCC TTC ATG ACA GAG GAT 67 9 

Leu Asp Met Lys Giy Gly His lie Asn Asp Aia Phe Met Thr Giu Asp 
190 195 200 

GAG AGG CTC ACC CCT CTC TGAAGGGCTG TTGTTCTGCT TCCTCAARAA 727 
Giu Arg Leu Thr Pro Leu 
205 

ATTAAACATT TGTTTCTGTG TGACTGCTGA GCATCCTGAA ATACCAAGAG CAGATCATAT 787 

WTTTTGTTTC ACCATTCTTC TTTTGTAATA AATTTTGAAT GTGCTTGAAA AAAAAAAAAA 847 

C 848 



■:2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 1 . . 14 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(0) OTHER INFORMATION: ccore 10.7 

sec LWLLFFLVTAIHA/EL 

(xi) SEQUENCE DESCRIPTION: SEQ 10 MO: 23: 



X-et Leu Trp Leu Leu Phe Phe Leu Val Thr Aid lie His Aia 

: ' 5 10 
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(2) IMFOi^JATIOM FOR SZQ ID NO: 29: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 
(3) TYPE: NUCLEIC ACID 
(C: 3TH.2lNDEDNESS: SINGLE 
(D) TOPOLOGV: LINEAR 

(11) K.OLECULE TYPE: Other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GGGAAGATGG AGATAGTATT GCCTG 25 



(2) INTORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

{B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Other nucleic acid 
(xil SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

CTGCCATGTA CATGATAGAG AGATTC 26 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHAKACTERISTICS: 

(A) LENGTH: 546 base pairs 

(3) TYPE; NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Geaomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: promoter 
(3) LOCATION: 1..517 

(ix) FEATURE: 

(A) NAME/KEY: transcription start site 
(3) LOCATION: 518 

(IX) FEATURE: 

(A) NAME/KEY: TF binding-Site 
(3) LOCATION: 17.. 25 

(C) IDENTIFICATION METHOD: matinspector prediction 
(D; OTHER INFORMATION: name CMY3_01 

score 0.983 
sequence TGTCAGTTG 
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(ix: FEATURE: 

"(A) NAME/KEY: TF bindingrsite 
{B) LOCATION': complement ( 18 .. 27 ) 

(C) IDENTIFICATION METHOD: natinspector prediction 

(D) OTHER INFORMATION: name MY0D_Q6 

score 0.961 
sequence CCCAACTGAC 

(ix) FEATURE; 

(A) NAME/KEY: TF binding-site 
(3) LOCATION: comoiement (75 , . 85 } 

(C) IDENTIFICATION METHOD: macinsoector orediction 

(D) OTHER INFORMATION: name S8_oi 

score 0.960 
sequence AATAGAATTAG 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 

(B) LOCATION: 94 . . 104 

(C) IDENTIFICATION METHOD: matinspector prediction 
(0) OTHER INFORMATION: name S8_01 

score 0.966 
sequence A.«ICTAAATTAG 

(ix) FEATURE: ^ 
(A) NAME/KEY: TF binding-site »' 
(3) LOCATION: complement (129. . 139) 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name DELTAEFl^Ol 

score 0.960 * 
sequence GCACACCTCAG 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 

(3) LOCATION: complement (155 165) 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name GATA^C 

score 0.964 
sequence AGATAAATCCA 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 
(3) LOCATION: 170. .178 

(C) IDENTIFICATION METHOD: rratmspector prediction 

(D) OTHER INFORMATION: na.-ne CMY3_01 

score 0.958 
sequence CTTCAGTTG 

(ix) FEATURE: 

(A) NAME/KEY: T? bi.iding-site 
(3) LOCATION: 176.. 189 

(C) IDENTIFICATION METHOD: rr.atinspector prediction 

(D) OTHER INFORMATION: name 3ATA1_02 

score 0.959 

sequence TTGTAGATAGGACA 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-sire 
{3) LOCATION: 180.. 190 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name ZMPi C 
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score C.953 
sequence AGATAGGACAT 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 

(B) LOCATION: 234.. 299 

(C) IDENTIFICATION METHOD: mat inspector prediction 
{0) OTHER INFORMATION: name TAL1AL?HAE4"7_0 1 

score 0,973 

sequence CATAACAGATGGT.V^G 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 

(B) LOCATION: 284.. 299 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name TAL1BETAE47_01 

score D.983 

sequence CATAACAGATGGTAAG 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 
(3) LOCATION: 284 •.299 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name TAL1B£TAITF2_01 

score 0.973 

sequence CATAACAGATGGTAAG 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 

(B) LOCATION: complement (287 .. 296} 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name MY0D_Q6 

score 0.9S4 
sequence ACCATCTGTT 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 

(B) LOCATION: complement ( 302 314 ) 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION! name GATA1_04 

score 0.953 

sequence TCAAGATAAAGTA 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 

(B) LOCATION: 393.. 405 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name IKlJol 

score 0.963 

sequence AGTTGGGAATTCC 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 

(B) LOCATION: 393. .404 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: na.Tie IK2joi 

score 0.995 

sequence AGTTGGGAATTC 



(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 
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:5) LOCATION: 396.. 405 

:C) IDENTIFICATION METHOD; nacir.spectcr predircion 

(C) OTHER IKsORMATION: name CREL^OI 

score 0.962 
sequence TGGGAATTCC 

;iv; FEATURZ: 

(A) NAME/KEY: IF binding-site 
{B: LOCATION: 423.. 436 

(C; IDEMTIcICATIO^^ METHOD: matinsoecr or prediction 
(D; OTHER INFORMATIOM: name GATAi_02 

score 0.950 

sequence TCAGTGATATGGCA 

FEATURE: 
(A) NAME/KEY: TF binding-site 
(3) LOCATIOM: complement ( 478 489 ) 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name SRY_02 

score 0.951 

sequence TAAAACAAAACA 

( ix ) FEATURE : 

(A) NAME/KEt: TF binding-Site 
:B) LOCATION: 436.. 493 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name E2F_02 

score 0,957 
sequence TTTAGCGC 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 

(B) LOCATION: complement ( 514 521 ) 

(C) IDENTIFICATION METHOD: matinspector prediction 
<D) OTHER INFORM.ATION: name MZFl^Ol 

score 0.975 
sequence TGAGGGGA 

[y.D SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

TGAGTCfcAGT GTTACATGTC AGTTGGGTTA AGTTTGTTAA TGTCATTCAA ATCTTCTATG 60 

TCTTGATTTG CCTGCTA.ATT CTATTATTTC TGGA.ACTAAA TTAGTTTGAT GGTTCTATTA 120 

GTTATTG.ACT GAGGTGTCCT AATCTCCCAT TATGTGGATT TATCTATTTC TTCAGTTGTA 180 

G.ATAGGACAT TGATAGATAC ATAAGTACCA GGACAAA.AGC .AGGGAGATCT TTTTTCC^AA 24 0 

ATCAGGAG.AA AAAAATG.ACA TCTGGAAAAC CTAT.AGGGAA AGGCATAACA GATGGT.AAGG 300 

AT.ACTTT.ATC TTGAGTAGGA GAGCCTTCCT GTGGCAACGT GGAGAAGGCA AG.AGGTCGTA 360 

GAATTGAGGA GTCAGCTCAG TT.AGA-AGCAG GGAGTTGGGA ATTCCGTTCA TGTGATTTAG 420 

CATCAGTG.^.T ATGGCAJi.ATG TGGGACTAAG GGTAGTGATC .AGAGGGTTAA A.ATTGTGTGT 480 

TTTGTTTT.-.:« CGCZZ-Z^ZZZ GC.2.TCGCCTT GGGTCCCCTC .AAACAGATTC CCATG.AATCT 54 0 
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(2) INFORMATIOM FOR SEQ ID NO: 32: 

(1) SEQUENCE CHAE^CTERISTICS: 

(A) L€NGTH: 23 base pairs 
(3) TV?£: MUCLEIC Acic 

(C) STRANDEDNESS: SI^^GLS 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



GTACCAGGGA CT3TGACCAT TGC 



{2; INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 
<D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



CTGTGACCAT TGCTCCCAAG AGAG 



(2) INFOR.MATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 
(0) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Geaomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: promoter 

(B) LOCATION: 1, .306 

(ix) FEATURE: 

(A) NA^!F./KEY: transcription start site 

(B) LOCATION: 807 

FEATURE: 
(A) NAME/KEY: TF binding-site 
(3) LOCATION: complement : 60 .. 70) 

(C) IDENTIFICATION METHOO: nat Inspector predic-ion 

(D) OTHER INFORMATION: name NFY_Q6 

score J. 956 
sequence GGACCAATCAT 



wo 99/06549 



23 



PCT/1B98/0I231 



(A) NAME/KEY: IF bincinc-sire 
(5) LOCATION: 70.. 77 

(C) IDENTiriCATIGN METHOD: matinspector precicriDn 

[D) OTHER INEORMATION: name MZFijDl 

score 0.962 
sequence CCTGGGGA 

ii>;: FEATURE: 

(A| NAME/KEY: T: bindiag-site 
(3) LOCATION: 124.. 132 

(C) IDENTIFICATION METHOD: matinspector precliccon 

(D) OTHER INFORMATION: name CMYB_01 

score 0.994 
sequence TGACCGTTG 

ax) FEATURE: 

(A) NAME/KEY: TF binding-site 

(3) LOCATION: complement ( 12 6 .. 134 ) 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name VMYb[_02 

score 0.985 
sequence TCCAACGGT 

(ix) FE.ATURE: 

(A) NAME/KEY: TF binding-site 

(B) LOCATION: 135.. 143 

(C) IDENTIFICATION METHOD: matinspector prediction 
CD) OTHER INFORMATION: name STAT^Ol 

score 0.968 
sequence TTCCTGGAA 

(ix} FELATURE: 

(A) NAME/KEY: TF binding-site 

(3) LOCATION: complement ( 135 .. 143 J 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name STAT_01 

score 0.951 
sequence TTCCAGGAA 

\i>:} FEATURE: 

(A) NAME/KEY: TF bindinq-5ite 

(3) LOCATION: complement ( 252 .. 259) 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name M2F1_01 

score 0-956 
sequence TTGGGGGA 

(Ix) FEATURE: 

(A) NAME/KEY: TF binding-site 
(3) LOCATION: 357.. 368 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name IK2_01 

score 0.965 

sequence GAATGGGATTTC 

;i:<; FEATURE: 

(A: M.^-i£/KEY: TF binding-site 
(3; LOCATION: 384.. 391 

{C: IDEN'TIFICATION METHOD: matinspector predict ior. 
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" (0) OTHER IMFORMATICN: p.dire M2F1_01 

score 0.985 
sequence AGAGGGGA 

(ix) FEATURE: 

(A) NAME/KEY: TF binGing-3::te 

(B) LOCATION: complement ( 10 42 1 } 

(C) IDENTIFICATION METHOD: matinspector prediction 
:D) OTHER INFORMATION: name SRY_02 

score 0.955 

sequence GAAA.z^CAAAACA 

(ix) FEATURE: 

(A) NAME/KEY: T? binding-site 

(B) LOCATION: 592.. 599 

(C) IDENTIFICATION MET.KOC: matinspector prediction 

(D) OTHER INFORMATION: name MZFl^Ol 

score 0.960 
sequence GAAGGGGA 

(ix) FEATURE: 

(A) NA>1E/KEY: 7F bindirg-site 

(B) LOCATION: 613. .SZ"? 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name MY0D_Q6 

score 0.981 
sequence AGCATCTGCC 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 

(B) LOCATION: 632-. 642 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name DELTA£?1_01 

score 0.958 
sequence TCCCACCTTCC 

(ix) FEATURE: 

(A) NAME/KEY: TF bindinc-site 

(3) LOCATION: complement (8 i 3 .. 323) 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name Sa^Ol 

score 0.992 
sequence GAGGCAATTAT 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 

(B) LOCATION: complement (82 4 8 31 ) 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name M2F1_01 

score 0.9?6 
sequence AGAGGGGA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



TACT.ATAGGG CACGCGTGGT CGACGGCCGG GCTGTTCTGG AGCAGAGGGC ATGTCAGTAA 60 
CGGTGACCGT TGG.^i.TTCCTG GA.AGCAG7AG CTGTTCTGT? TGGATCTGGT .J^GGGAC.AGGG 180 
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CTCAGACCGC TAGGCACGAG GGAAGGTCAG AG3AGAACG3 AGG3ARGGCC CAGTGAGARG 2^.0 

G3AGCATGCC TTCCCCCAAC CCTG3CTTSC YCTTGGYt'J^M AGGGCGKTTY IGGGMACTTa 300 

,\-.YTrAGGGC CCAASCAGAA SCACAGGCCC AKTCKTGGCT SMAAGCACAA TAGCCTGAAT 350 

GGGATTTCAG GTTAGNCAGG G7GAGAGGGG AGGCTCTCTG GCTTAGTTTT GTTTTGTTTT ^20 

CCiJ^-^TCAnG GT.^.ACTTGCT CCCTTCTGCT ACGGGCCTTG GTCTTGGCTT GTCCTCACCC 4b 2 

AGTCGGAACT CCCTACCACT TTCAGGAGAG TGGT7TTAGG CCCGTGGGGC TGTTCTGTTC 540 

CAJ\GCAGTGT GAGAACATGG CTGGTAGAGG CTCTAGCTGT GTGCGGGGCC TGAAGGGGAG 600 

TGGGTTCTCG CCCAAAGAGC ATCTGCCCAT TTCCCACCTT CCCT7CTCCC ACCAGAAGC? 66C 

TGCCTGAGCT GTTTGGACAA AAATCCAAAC CCCACTTGGC TACTCTGGCC TGGCTTCAGC 720 

TTGGAACCCA ATACCTAGGC TTACAGGCCA TCCTGAGCCA GGGGCCTC7G GAAATTCTCT 780 

TCCTGATGGT CCTTTAGGTT TGGGCACAAA ATATAATTGC CTCTCCCCTC 7CCCATTTTC 840 

7C7C:TGGGA GCAA7GG7CA C 961 

(2) INr0RMA7I0N FOR SEQ ID NO: 35: 

(i) SSQUSNCE; CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

MOLECULE TYPE: Other nucleic acid 
{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

CTGGGATGGA AGGCACGGTA 20 

(2) IMFORMATION FOR SEQ ID NO: 36: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 
(0) TOPOLOGY: LINEAR 

lii) MOLECULE TYPE: Other nucleic acid 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

GAG.-.CCACAC AGCTAGACAA 2: 

:z; :ntc?.m^tiom ro5 seq id no: 37: 
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(i)"SECUENCE CHARACTERISTICS: 

(A) LEMGTH: 555 base pairs 
{B} TYPE: NUCLEIC ACID 
(C! 3TRANDEDNESS : DOUBLE 
(D) TOPOLOGY: LINEA.R 

(li) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: promoter 
(3) LOCATION: 1 . . 500 

(ix) FEATURE: 

(A) NAME/KEY: transcription start sice 
(3) LOCATION: 501 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 
:3) LOCATION: 191.. 206 

:C) IDENTIFICATION METHOD: matinspector prediction 
vD) OTHER INFORMATION: name ARNT_01 

score 0.964 

sequence GGACTCACGTGCTGCT 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 

(B) LOCATION: 193.. 204 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name NMYC^Ol 

score 0.965 

sequence ACTCACGTGCTG 

(ix) FEATURE: 

(A) NAi>4E/KEY: TF binding-site 
(3) LOCATION: 193.. 204 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name USfJoI 

score 0.995 

sequence ACTCACGTGCTG 

fix) FEATURE: 

(A) NAME/KEY: TF bind:Lng-3ite 

(31 LOCATION: complement (193. .204) 

(C) IDENTIFICATION METHOD: .natinspector prediction 

(D) OTHER INFORMATION: name U3F_01 

score 0.995 

sequence CAGCACGTGAGT 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 

(B) LOCATION: complement ( 193 204 ) 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name NMYC_01 

score 0.956 

sequence CAGCACGTGAGT 

(ixJ FE.^TURE: 

(A) NAJyiE/KEY: TF binding-site 

(3) LOCATION: corr.piement ( 1 93 . . 204 ) 

(C) IDENTIFICATION METHOD: mat in.spect.'Jr predirticn 
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(D) OTHER INFORKQlTION; name MYCMAX_02 

score 0.972 
sequence CAGCACGTGAGT 

(i:<) FEATURE: 

(A; NAWE/KEY: TF bir.dina-site 
(3) LOCATION: 195.. 202 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION:' name USF_C 

score C.997 
sequence TCACGTGC 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 

(B) LOCATION: complement ( 195 202 : 

■C) IDENTIFICATION METHOD: matmspectcr prediction 

{D) OTHER INFORMATION; name USF_C 

score 0.991 
sequence GCACGTGA 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 

(3) LOCATION: complement (210 .. 217 ) 

(C) IDENTIFICATION METHOD: matinsoector orediction 

(D) OTHER INFORMATION: name MZFl^Ol 

score 0.96S 
sequence CATGGGGA 

(ix) FEATURE: 

(A) NAME/KEY: TF binding-site 

(B) LOCATION: 397.. 4 10 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name ELK1_02 

3core 0.963 

sequence CTCTCCGGAAGCCT 

(ix) FE.ATURE: 

(A) NAME/KEY: TF binding-site 
(3) LOCATION: 400.. 409 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name CETSl?54_0i 

score C-974 
sequence TCCGGAAGCC 

(ix) FE.ATURE: 

(A) NAME/KEY: TF binding-site 

(B) LOCATION: complement ( 4 60 .. 470 ) 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name AP1_Q4 

score 0.963 
sequence AGTGACTGAAC 

(IX) FE.i.TURE: 

(A) NAME/KEY: TF binding-site 

(3) L(XATION: Complement ( 4 60 .. 470 i 

(C) IDENTIFICATION METHOD: matinspector prediction 

(D) OTHER INFORMATION: name AP1FJ_Q2 

score C.96T 
sequence AGTGACTGAAC 
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-- (A; MAI'IE/KEY: TF bi-dirg-site 
{3} LOCATION: 547 . . 555 

iC) IDiNTIFICATIOM METHOD: nia-inspector oreliccicn 

(D) OTHER INFORMATION: name PADS^C 

score 1.000 
sequence TGTGGTCTC 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

CTATAGGGCA CGCKTGGTCG ACGGCCCGGG CTGGTCTGGT CTGTKGTGGA GTCGGGTTCA 6: 

AGGACAGCAT TTGTKACATC TGGTCTACTG CACCTTCCCT CTGCCGTGCA CTTGGCCTTT 123 

KAWAAGCTCA GCACCGGTGC CCATCACAGG GCCGGCAGCA CACACATCCC ATTACTCAGA 150 

AGGA.ACTGAC GGACTCACGT GCTGCTCCGT CCCCATGAGC TCAGTGGACC TGTCTATGTA 24 0 

GAGCAGTCAG ACAGTGCCTG GGATAGAGTG AGAGTTCAGC CAGTAAATCC A^GTGATTGT 3CD 

CATTCCTGTC TGCATTAGTA ACTCCCAACC TAGATGTGAA AACTTAGTTC TTTCTCATAG 360 

GTTGCTCTGC CCATGGTCCC ACTGCAGACC CAGGCACTCT CCGGAAGCCT GGAAATCACC 42 3 

CGTGTCTTCT GCCTGCTCCC GCTCACATCC CACACTTGTG TTCAGTCACT GAGTTACAGA 480 

TTTTGCCTCC TCAATTTC7C TTGTCTTAGT CCCATCCTCT GTTCCCCTGG CCAGTTTGTC 540 

TAGCTGTGTG GTCTC 555 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 64 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/ KEY; sig_peptide 
(9) LOCATION: 90. .179 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
O) OTHER INFORMATION: score 13.2 

seq LLLLSTLVIP5AA/AP 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



.-J^ACAGTAC GTGGGCGGCC GGAATCCGGG AGTCCGGTGA CCCGGGCTGT GGTCTAGCA? 6 3 

AAAGGCGGAG CCAGAAGAAG GGGCGGGGT .^TG GGA GAA GCC TCC CCA CC7 GCC 113 

Met Giy Glu Ala Ser Pre Pro Ala 
-30 -25 
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CCC GCA A33 CGG CAT CTG CTG GTC CTG CTG CTG CTC CTC TCT ACC C7G 161 
Pro Aia Arg Arg His Leu Leu Vai Leu Leu Leu Leu Leu Ser Thr Leu 
-20 -15 -iC 

GTG ATC CC" TCC GCT GCA GC? CCT ATC CAT GAT GCT GAC GCC CAA GAG 209 
Vdi He rrD Ser Aid Aia Aia Pre lie His Asp Aia Asp Aia Gin Giu 
-5 1 5 10 

A3C TCC TT-3 GGT C?C ACA GGC CTC CAG AGC CTA CTC CAA GGC TTC AGC 257 
Ser Ser Leu Giy Leu Thr Giy Leu Gin Ser Leu Leu Gin Giy Phe Ser 
15 20 25 

CGA CTT TTC CTG AAA GGT AAC CTG CTT CGG GGC ATA GAC AGC TTA TTC 305 
Arg Leu Phe Leu Lys Giy Asa Leu Leu Arg Giy lie Asp Ser Leu Phe 
30 35 40 

TCT GCC CCC ATG GAC TTC CGG GGC CTC CCT GGG AAC TAC CAC AAA GAG 353 
Ser Aia Pro Met Asp Phe Arg Giy Leu Pro Giy Asn Tyr His Lys Giu 

45 50 55 

GAG AAC CAG GAG CAC CAG CTG GGG AAC AAC ACC CTC TCC AGC MAC CTC 401 
Giu Asn Gin Giu His Gin Leu Giy Asn Asn Thr Leu Ser Ser Xaa Leu 
60 65 70 

CAG ATC GAC MNG ATG ACC GAC AAC AAG ACA GGA GAG GTG CTG ATC TCC 449 
Gin He Asp Xaa Met Thr Asp Asn Lys Thr Giy Giu Vai Leu lie Ser 
75 80 85 90 

GAG AAT GTG GTG GCA 464 
Giu Asn Vai Vai Ala 
95 



(2) INFORi'IATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii} MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) N.2\M£/KEY: sig^peptide 

(B) LOCATION: 56,. 118 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 12 

sec VLVLCyLLLQAQG/GY 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 39: 



AAGGGAAGT: TGTGACTGCC TGGCCAGACT TA3GGCTCAC GCTCTGGTCA GAGTT ATG 53 

Mec 
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GCA CCC CAG ACT CTG CTG CCT GTC CTG GT? CTC TGT 3TG CTG CTG CTG 106 
Ala Pro Gin Thr Leu Leu Pro Val Leu Vai Leu Cvs Vai Leu Leu Leu 
-2C -15 -10 ' -b 

CAG GCC CAG GGA GGA TAG CGT GAC AAG ATG AGG ATG CAG AGA ATC AAG 154 
Glr. Aia Gin Gly Giy Tyr Arg Asp Lys Met Arg Met Gin Arg lie Lys 

1 5 :c . 

GTC TGT GAG AAG CGA CCC AGO ATA GAT CTA TGC ATC CAC CAC AGG 199 
Val Cys Glu Lys Arg Pro Ser lie Asp Leu Cys lie His His Arg 
15 20 25 



(2) INFO[IMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 9 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(li) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 47,. 103 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 11 

seq SLVLLLCLTCSYA/FM 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



AAAGCAAACC CGTCATGAGC AACTCCCTTC CCCATCTCTG TTCACC ATG TGG ACG 55 

Met Trp Thr 

CTG AAA TCG TCC CTG GTC CTG CTT CTG TGC CTC ACC TGC AGC TAT GCC 103 

Leu Lys Ser Ser Leu Val Leu Leu Leu Cys Leu Tnr Cys Ser Tyr Ala 

-15 -10 -5 

TTT ATG TTC TCT TCT CTG AGA CAG AAA ACT AGC GAA CCC CAG GGG AAG 151 

Phe Met Phe Ser Ser Leu Arg Gin Lys Thr Ser Glu Pro Gin Gly Lys 

1 5 10 15 

GTG CAA TAC GGA GAG CAC TTT CGG ATT CGG CPvG A.AT CTA CCA GAG CAC 199 

Vai Gin Tyr Gly Glu His Phe A.rg lie Arg Gin Asn Leu Pro Glu His 

20 25 30 

ACC CAA ZZZ TGG CTT GGG AGC AAA TGG CTC TGG CTT CTT KTT GTT GTT 24 7 

Thr Gin Gly Trp Leu Gly Ser Lys Tro Leu Tro Leu Leu Xaa Vai Vai 

35 40 * " 45 

GTG CCG TTT GTG ATA CTG CAG TCT CAA AGA GAC AGT GAG AAG AAT AAG 295 

Vai Pro Pre Val He Leu Gin Cys Gin Arg Asp Ser Glu Lys Asn Lys 

50 55 50 
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GAG CAG Pr3T CCT CCT 3GC CTT CGA GGC GGC CAA CTT CAC TCT CCA TTA 34 3 
Glu Gin Ser Pro ?rc 31y L=u Ar^ Giy Giy Gin Leu His Ser Pro Leu 
65 70 75 80 

AAG AAA 34 9 

Lys Lys 



(2) INFORMATION FOR S£Q ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 414 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

ivi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(rj TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 70. .il7 

(C) IDENTIFICATION METHOD: Von Hsijne matrix 

(D) OTHER INFORMATION: score 10.6 

seq LLLLPLLWGGSLQ/EK 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



AAATTTTGGA GCATTTCCTT CCCTGACAGC CGGACCTGGG ACTGGGCTGG GGCCCTGGCC 60 

GATGGAGAC ATG CTG CCC CTG CTG CTG CTG CCC CTG CTG TGG GGG GGG TCC 111 
Met Leu Pro Leu Leu Leu Leu Pro Leu Leu Trp Giy Giy Ser 
-15 -10 -5 

CTG CAG GAG AAG CCA GTG TAC GAG CTG CAA GTG CAG AAG TCG GTG ACG 159 
Leu Gin Giu Lys Pro Vai Tyr Giu Leu Gin Val Gin Lys Ser Vai Thr 
1 S 10 

GTG CAG GAG GGC CTG TGC GTC CTT GTG CCC TGC TCC TTC TCT TAC CCC 207 
Val Gin Giu Giy Leu Cys Val Leu Val Pro Cys Ser Phe Ser Tyr Pro 
15 20 25 30 

TGG AGA TCC TGG TAT TCC TCT CCC CCA CTC TAC GTC TAC TGG TTC CGC 25S 
Trp Arg Ser Trp Tyr Ser Ser Pro Pro Leu Tyr Val Tyr Trp Phe Arg 
35 40 ' 45 

GAC GGG GAG ATC CCA TAC TAC GCT GAG GTT GTG GCC ACA AAC AAC CCA 303 
Asp Giy Glu lie Pro Tyr Tyr Ala Glu Val Val Ala Thr Asn Asn Pro 
50 55 60 

GAC AGA a: A GTG A-AG CCA GAG ACC CAG GGC CGA TTC CGC CTC CTT GGG 351 
Asp Arg Arg Val Lys Pre Giu Thr Gin Giy Arq Phe Arg Leu Leu Giy 
65 70 75 

GAT GTC A.-.G AAG .A.AC TGC TCT CTG AGC ATC.GGA GAT SCC AGA ATG 399 
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Asp Vai Gir. Lys Lys Asn Cys Ser Leu Ser lie G-y Asp Xaa Arg Met 

30 " 8 5 90 

GAG GAG AC3 GGC GGG 414 
Giu Asp Thr Gly Giy 
95 ' • 



(V-: I^J^ORC-lnTION FOR SEQ ID NO: 42: 

;i) SS0UENC2 CHARACTERISTICS: 

(A) LENGTH: 215 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) t-IOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE:. 

(A) ORGANISM: Homo Sapiens 
{:) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 24.. 101 

(C) IDENTIFICATION METHOD: Von Heijne marrix 

(D) OTHER INFORMATION: score 10,4 

seq LLLLLCGPSQDQC/R? 

{xD SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



AANCCACCTG CSGCCCGCCA GCC ATG GAG ACT GGA GCG CTG CGG CGC CCG CAA 53 

Met Giu Thr Gly Aia Leu Arg Arg Pro Gin 
-25 -20 

CTT CTC CCG TTG CTG CTG CTG CTC TGC GGC CCT TCC CAG GAT CAA TGC 101 
Leu Leu Pro Leu Leu Leu Leu Leu Cys Giy Pro Ser Gin Asp Gin Cys 
-15 -iO -5 

CGA CCT GTA CTC CAG AAT CTG TTG CAG AGC CCA GGC TTG ACA TGG AGC 14 9 
Arg Pro Vai Leu Gin Asn Leu Leu Gin Ser Pro Giy Leu Thr Trp Ser 
i 5 iG 15 

TTG GAA GTG CCC ACT GGG AGA GAA GGA AAG GAA GGT ACT ATG AGA GTT 197 
Leu Giu Vai Pro Thr Gly Arg Giu Gly Ly5 Giu Giy Thr Met Arg Vai 
20 25 30 

TCA CCA ACT GCA CCA AGG 215 
Ser Pro Thr Aia Pro Ara 
35 



(2 J INFOR^UATI0M FOR SEQ ID NO: 43: 

•i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 base pairs 
(3) TYPE: NUCLEIC ACID 
iC) STRANDEDMESS: DOUBLE 
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(D) TOPOLOGY: LINEAR 

{ii) MOLECULE TYPE: CDiNA 

{VI) ORIGINAL SOURCE: 

(A) ORGANISM: Hcmo Sapiens 
(F: tissue TYPE: Testis 

(i;<) FEATURE:' 

(A) NAME/KEY: si(;_peptide 
(3) LOCATION: 4 9. . 96 

(C) IDENTIFICATIOM METHOD: Von Heijne natrix 

(D) OTHER INFORMATION: score 10.1 

seq LVLTLCTLPLAVA/SA 

(XI > SLQUENCt ULSCRIPTION: SEQ ID NO: 43: 



AAAGATCCC7 GCAGCCCGGC AGGAGAGAAG GCTGAGCCTT CTGGCGTC ATG GAG AGG 5? 

Met Giu Arg 
-15 

CTC GTC CTA ACC CTG TGC ACC CTC CCG CTG GCT GTG GCG TCT GCT GGC 1C5 
Leu Val Leu Thr Leu Cys Thr Leu Pro Leu Ala Vai Ala Ser Ala Gly 
-10 -5 1 

TGC GCC ACG ACG CCA GCT CGC AAC CTG AGC TGC TAC CAG TGC TTC AAG 153 
Cys Ala Thr Thr Pro Aid Al:^ Asn Leu Ser Cys Tyr Gin Cys Phe Lys 
5 10 15 

GTC AGC AGC TGG ACG GAG TGC CCG CCC ACC TGG TGC AGC CCG CTG GAC 201 
Val Ser Ser Trp Thr Glu Cys Pro Pro Thr Trp Cys Ser Pro Leu Asp 
20 25 30 35 

CAA GTC TGC ATC TCC AAC GAG GTG GTC GTC TCT TTT AAA TGG AGT GTA 24 9 
Gin Val Cys lie Ser Asn Giu Val Vai Val Ser Phe Lys Trp Ser Val 
40 45 50 

CGC GTC CTG CTC AGC AAA CGC TGT GCT CCC AGA TGT CCC AAC TCA GGG 297 
Arg Val Leu Leu Ser Lys Arg Cys Ala Pro Arg Cys Pro Asn Scr Gly 
55 60 65 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 
(0) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CONA 

(vi' ORIGINAL SOURCE: 

(A) ORGANISM: Hcmo Sapiens 
(F) TISSUE TYPE: Testis 

(ix: FEATURE: 

(A) NAME/KEY: sig^peptide 
(31 LOCATION: 62.7l30 
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tC« IDENTIFICATIOM H£THO0: Von Heiine natrix 
(0) OTHEIR INTORtlATIOM: score 9.3 ' 

seq FLLFFFLFLLTRG/SL 

(>;i) sequence: DESCRIPTION: SEQ ID NO: 44: 



ACATGG7CGG YGTGCAGGAT ATTTCGCTGG ACCCTAGAAA AGCCACCA3G ACCTGTGGGC 60 

C ATG ATG CTA CCC CAA TGG CTG CTG CTG CTG TTC CTT CTC TTC T?C TTT 1Q9 
Met Mec Leu ?ro Gin Trp Leu Leu Leu Leu Phe Leu Leu ?he Phe Phe 
-20 -15 -10 

CTC TTC CTC CTC ACC AGG GGC TCA CTT TCT CCA ACA AAA TAC AA.C CTT 157 
Leu Phe Leu Leu Thr Arg Gly Ser Leu Ser Pro Thr Lys Tyr Asa Leu 
-5 15 

TTG GAG CTC AAG GAG KSK KGC ATS GGG AAC CAG GAC TGC GAG ACT GGC 205 
Leu Giu Leu Lys Glu Xaa Xda Xaa Gly Asn Gin As? Cys Glu Thr Gly 
10 15 20 25 

TGC TGC CAA CGT GCT CCA GAC AAT TGC GAG TCG CAC TGC GCG GAG AAG 253 
Cys Cys Gin Arg Ala Pro Asp Asn Cys Giu Ser His Cys Ala Glu Lys 
30 35 40 

GGG TCC GAG GGC AGT CTG TGT CAA ACG CAG GTG TTC TTT GGC CAA TAT 301 
Gly Ser Glu Gly Ser Leu Cys Gin Thr Gin Val Phe Phe Gly Gin Tyr 
45 50 55 

AG A GCG TGT CCC TGC CTG CGG AAC CTG ACT TGT ATA TAT TCA AAG AAT 349 
Arg Ala Cys Pro Cys Leu Arg Asn Leu Thr Cys lie Tyr Ser Lys Asn 
60 65 70 

GAC AAA TGG CTT AGC ATC GCC TAT GGC CGT TGT CAG AAA ATT CCA AGC 397 
Giu Lys Trp Leu Ser lie Ala Tyr Gly Arg Cys Gin Lys lie Gly Arg 
75 80 85 

CAG AAG TTG GCT AGR AAA TGT TCT 421 
Gin Lys Leu Ala Arg Lys Cys Ser 
90 95 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii^ MOLECULE TYPE: CDNA 

{vi} ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
it) TISSUE TYPE: Testis 

(i;:} FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 63.. 133 

{O IDENTIFICATION METHOD: Vcr. Hsijne natri:< 
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(D; other IN?ORt^A?ION: score 9.6 

seq LVVFCLALQLVP3/3? 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



.^AACAGCAGT GCCTGGTCAA ACCCAGCAAC CCTTGGCCAG AACTTACTCA CCCATCCCAC 6C 

7GACACC ATG AAG CCT GTG CTG OCT CTC CAG TWC CTG GTG GTG TTC TGC 109 
Met Lys Pro Val Leu Pro Leu Gin Xaa Leu Val Val Fhe Cys 
-20 -15 -10 

CTA GCA CTG CAG CTG GTG CCT GGG AGT CCC AAG CAG CTA GGG 151 
Leu Aia Leu Gin Leu Val Pro Gly Ser Pro Lys Gin Leu Giy 
-5 1 5 



(2) I«rORil^TION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

{ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spieen 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 134.. 238 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 9.5 

seq LFFSLFSAPLASA/VR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



AAACAGATAC TCCCAGCACA TGTTCCAWAG CAGCCCCCTG ATCCAATTTT CCTTAGCACG 60 

TAGGCTCAAG ACAATGCCCC ACTTCCCAAA GGCCTTGTGG CAATGTCCTC TTTTTCT7TC 120 

ACATATATGA TTT ATG TTC CGT CAA CCA CAG GA.A ACT GCT CAA AGA TCC 169 
Met Phe Arg Gin Arg Gin Gia Thr Aia Gin Arg Ser 
-35 -30 -25 

ACC C.i.G TCC TGC CGC TGC CCC CGT GAT GGT TTG TTT TTC TCA TTG TTT 217 
T.^.r Gin Ser Cys Arg Cys Pro Arg Asp Gly Leu Phe Phe Ser Leu P.ne 
-20 -15 -10 

AGC GCT CCA TTA GCT TCC GCA GTG AGA GCC! GCC ASG 253 
Ser Ala ?ro Leu Aia Ser Ala Vai Arg Aia Aia Xaa 
-5 1 5 



:>:F0R>LATI0N for seq id NO: 47: 
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ii: SEQUENCE CHAR.0.CTERI3TICS: 

(A) LENGTH: 124 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi; ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/ KEY: sig_pepride 

(B) LOCATION: 14 . . 91 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 9.4 

seq RLLLALPLALVLG/FE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 



AGTACCACAG GCA ATG GGG TCA AGT GCC TGT GAA ATA GCT GTC GGG ACT 4 9 

Met Giy Ser Ser Ala Cys Glu lie Ala Val Gly Thr 
-25 -20 -15 

AAA AGG TTA TTA TTA GCT CTG OCT CTC GCT CTT GTT CTG GGC TTT GAA 97 
Lys Arg Leu Leu Leu Ala Leu Pro Leu Ala Leu Val Leu Gly ?he Glu 
-10 -5 1 

GGC TCA TCA GTT CCC CCA AGA AAT TTT 124 
Gly Ser Ser Val Pro Pro Arg Asn Phe 
5 10 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F> TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAa^JE/KEY: sig^peptide 

(B) LOCATION: 186.. 254 

(C) IDENTIFICATION METHOD: Vcn Heijne matrix 

(D) OTHER INFORMATION: score 9.4 

sec SLLFICFFGESFC/IC 

(xi) SEQUENCE DESCRIPTION: SEQ 10 I'-O: 48: 
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AATATTTT3C TGACTGGCAA GGTTATATGA AGTGCT7TTA TTGAAGCArC ATTT7/\^CTA 60 

A7AGCTCCT3 GTATTTTCTG CTTCCCTTCG TA3GGAATTT AGTTATTTTA T7TTATTATT 120 

TAGC7A.'^7TT AGCTA7TTTA AAATAGC7AA AT77TAGCTA CTTTTT7TTC .^^7TGAC.W. 150 

Ct.a.:^.GG A7G TCT AAT CAA AGA CTA CCG CTG ATT TTT TCT CTG TTG TTT A7C 230 
Het Ser Asn Gin Arg Leu Pro leu lie Phe Ser Leu Leu ?ne lie 
-20 -15 -10 

TGC TTC T7C GGG GAG AGT TTC TGC ATT TGT GAT GGA ACT GTC TGG ACA 273 
Cvs Phe Phe Giv Giu Ser Phe Cys lie Cvs Asp Gly Thr Va. Trc Thr 
-5 1 ' 5 ' 

WWG GTT KRA TGG GAG ATT CTT CCA GAA GAA GTA CAT TAT TGG AAA GTT 325 
Xaa Val Xaa Trp Giu lie Leu Pro Giu Giu Vai His Tyr Trp Lys Vai 
10 15 20 

AAG GGT TCT CCA TCT CAC TGC CTG CGG 353 
Lys Gly Ser Pro Ser His Cys Leu Arg 
25 30 



{2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANOEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptice 
(9) LOCATION: 108.. 155 

(C) IDENTIFICATION METHOD: Von Hei^ne matrix 

(D) OTHER INFORMATION: score 9.2 

5eq FLSFLLALLSLNC/IP 

(xi) SEQUENCE DESCRIPTION: SEQ ID N'O: 4 9: 



ACATCGAGTT TACAAAATTT ATATTATCAT GAAATACTTC AATA6AGGGT TGGGAAATCT 60 

A2^.CTCTGGAG GAAATGCC.AC A.BiATTTCCAC TGCTGGGGTT TTTGAAG ATG C7C TGG 116 

Met Leu Trp 



7TC CTA 7CT TTT CTT CTA GCT CTC CTT TCC CTC AAT TGT .^.TC CCC ATC 164 
Phe Leu Ser Phe Leu Leu Ala Leu Leu Ser Leu Asn Cys lie Pro He 
-10 -5 ^ 1 



157 
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(2; INFORMATION FOR SEQ ID NO: SO: 

(i) SEQUSNCZ CHARACTERISTICS: 

(A) LEMGTH: 203 base pairs 
(3) TYPE; NUCLEIC ACID 
(C) STRANDEDNESS: DOUBLE 
(D; TOPOLOGY: LIiNEAR 

(il) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(i:<) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 84.. 155 

(C) IDENTIFICATION METHOD: Von Heijne marrix 

(D) OTHER INFORMATION: score 9.2 

seq ICCVIVLISLSWT/S? 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



ATGTKCGAAT TTATTGCTGA GACTTTCCAG TGCATTTTGC ATCTCTCTGA GTGTGTCCTT 60 

GATTTCCAAA AGTTGTTTTA TTT ATG CTG TKT ATT TCA CTC GAG ATT KTT TCC 113 

Met Leu Xaa lie Ser Leu Giu lie Xaa Ser 
-20 -15 

TTC ATA TGC TGT GTC ATT GTT TTG ATT TCT TTA AGT TGG ACT TCA OCT 151 
Phe lie Cvs Cys Val lie Val Leu lie Ser Leu Ser Trp Ttir Ser Pro 
-10 -5 1 

TTC ACT GGT GTG TAC TTG ATT GGT TTA ATA ATC GAG CCA GGG 203 
^ba Thr Giy Val Tyr Leu lie Gly Leu lie He Glu Pro Gly 
5 10 15 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 266 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?} TISSUE TYPE: Ovary 

(ix) FEATURE: 

( ;•. :» NA>!E / :<F. Y : <; i ff ^pppr i de 
(5) LOCATION: 183.. 239 

:Z) IDENTIFICATION METHOD: Von Heijne matri:'. 
:Zt 0T.-:ER information: scor.* 9.2 
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seq IliFZlTFrSHTrC/SR 
ixi) SEQUENCE DESCRIPTION: SEQ ID KO: 51: 

AATTTCACTG ATGTC7AGCT GTGGCTCTCT TTTTATACCT CCTATTTA.^lT ACCACA7GG7 6C 

CTTTGAAACC TGGAGACT7A CTGATTTCTT GAGCTCTAGT AAATGTTCTT 7TCTCA7TTA 120 

A77GATCA77 TTCTCCCATT TGTTGTC7CC TTACATCCCC AGGGCATTAC 7ATTTTGTAG 130 

CT A7G GTA T7C AGG AAC TGC ATT 77A T7T ATT TTA ACT T7T TTT TCT 227 
Met Val Fhe Arg Asn Cys lie Leu Phe lie Leu Thr Phe Phe Ser 
-15 -10 -5 

CAT ACT TTC TGT AGT AGG CAG AAT AAA GCC CAG CCC TGG 266 
His Thr Phe Cys Ser Arg Gin Asn Lys Ala Gin Pro Trp 
1 5 



(2) IJJrOBMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 
<D) TOPOLOGY: LINEAR 

Cii) MOLECULE TYPE: CDNA 

{vi) ORIGINAL SOURCE: 

(A} ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 7.. 45 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 9.1 

seq MLAACPLSPGCQS/AP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



GACGGC ATG CTG GCT GCG TGT CCC CTC 7CA CCA CCT TGC CAA AGC GCT 4 9 

Met Leu Ala Ala Cys Pro Leu Ser Pro Gly Cys Gin Ser Ala 
-10 -5 1 

CCA TCA ACG TGG AAT CAT TTT CCT CCT GAA AGA ATA ACC ACT GGA GCC 96 
Pro Ser Thr Trp Asn His Phe Pro Pro Giu Arg He Thr Thr Gly Ala 
5 10 15 



GGC AGC CTT CTG AAA CCA GGG GGT GGC CTC TGG CCA CGC ACA GTC TCT 
Gly Si^r T.eii T.ao Lys Pro Gly Gly Gly Leu Trp Pre Arg Thr Val Ser 
20 25 30 



144 
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(2; INF0R£4ATI0iM FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 base pairs 

(B) TYPE: NUCLEIC ACID 

( C ) STRANDEDMESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) MAME/KEY: sig_peptide 
(3) LOCATION: 43.. 99 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 9.1 

seq FLTLITHCTVSWA/QS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



AAGCTGCGGG TAGAG.AAGAC AGGACTCAGG ACA^TCTCCA GC .=-.TG GCC TGG TCC 5 4 

Met Ala Trp Ser 

CCT CTC TTC CTC .^VCC CTC ATC ACT CAC TGT ACA GTG TCC TGG GCC CAG 102 
Pro Leu Phe Leu Thr Leu lie Thr His Cys Thr Val Ser Tro Ala Gin 
-15 -10 -5 ' 1 

TCT GTT CTG ACT CAG CCA CCC TCG GTG TCT GAA GCC CCC AGA CAG AGG 150 
Ser Val Leu Thr Gin Pro Pro Ser Val Ser Giu Aia Pro Arg Gin Arg 
5 10 IS 

GTC ACC ATC TCC TGT TTT GGA AGC AGC TCC AAT ATC GGA CGA PAT GCT 193 
Val Thr lie Ser Cys Phe Gly Ser Ser Ser Asn lie Giy Arg Asn Ala 
20 25 30 

GTA PAC TGG TAT CAG CAA CTC CCA GGA AGG TCT CCC AGA CTT CTC ATT 24 6 
Val Asn Trp Tyr Gin Gin Leu Pro Gly Arg Ser Pro Arg Leu Leu lie 
35 40 45 

TTT TAT PJKT AAT CTC CCG GCA TCG 270 
Phe Tyr Asn Asn Leu Pro Ala Ser 
50 55 



(2) INFORMATION FOR SEQ ID NO: 54: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 
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(vli ORIGINAL source:: 

(A; ORGANISM: Homo Sapiens 
(7) TTSSUE TYPFl: Tpscis 



(ix) FEATURE : 

(A) NA^-ri/KEY: sig^pepCide 
(3) LOCATIOM: 49.. 102 

iZ) IDENTIFICATION METHOD: Von Hei:ne macrix 
iZ) OTHER INFORMATION: score 9.1 

seq LVSLCSWSPPLTS/S? 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO; 54: 



ACAGGCATAG ATCTAGCCCC ACCATCA-AGA CA.sjs^CAACAT TTTCTATT ATG TTA AA.A 5? 

Met Leu Lys 

A3T GTC CT? GTA AGC CTT TGC AGT TGG TCT CCT CCC CTG ACT TCC AGC 105 
3er Val Leu Vai 3er Leu Cvs Ser Trp Ser Pro Pro Leu Thr Ser Ser 
-15 -10 ' ' -5 1 

CCC AGG 111 
Pro Arq 

m' 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

.{ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 1547.219 

(C) IDENTIFICATION METHOD: Vcr. Heijne matrix 

(D) OTHER INFORMATION: score 9 

seq FILAALSLSTTF3/LQ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 



AA.KACCCTCC CTCCCGTTGC TCCAAACTAA TACGGACTGA ACGGATCGCT GCGAGGGTGG 60 

GAGAGAAAAT TAGGGGGAGA AAGGACAGAK AGAKCAACTA CCATCCATAG CCAGATAGAT 12C 

TATCTTACAC TGA-ACTGATC AA3TACTKTG AAA ATG ACT TCG AAA TTN ATC TTG 1"?4 

Mei Thr Ser Lys Xaa lie Leu 
-20 



TCC 7TC ATA CTT GCT 3CA CTG AGT CTT TCA ACC 
Ser Pr.e He Leu Ala Ala Leu Ser Leu Ser- Thr 
-10 -5 



=iCC TTT TCT CTC 
rhr ?he Ser Leu 



222 
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c.^A CCA ta: cac car aag gtt cta cta gtt tot ttt gat 3GA TTc err 

Gin Pro Tyr Gin Gin Lys Vai Leu Leu Vai Ser ?he Asp Giy The Arg 
5 10 1 - 



•^0 



TGG GAT TAC TTA TAT 
Trp Asp Tyr Leu Tyr 
2D 



235 



(2) INFO?^>--i.T:OM FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

{D; TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE; 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: Other 
(3) LOCATION: 35.. 120 

(C) IDENTIFICATION METHOD: blastn 
(0) OTHER INFORMATION: identity 100 

region 44 . . 79 
id AA280744 
est 

(ix; FEATURE: 

I A) NAME/KEY: sig^peptide 

(B) LOCATION: 52.. Ill 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 8.9 

seq LAVXLGLATAVSA/GP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 



GAAAGTGAAA. GGAGGAAGAG GAGGCTAAAT GGCTGAGGAG GTCGCAGCGC C ATG AAG 57 

Met Lys 
-20 

TCC CTG TCT CTV MTC CTM GCT GTG GMT TTG GGC CTG GCG ACC GCC GTC 105 
Ser Leu Ser Leu Xaa Leu Ala Vai Xaa Leu Gly Leu Ala Thr Aia Vai 
-15 -10 -5 

TCA GCA GGA CCC GCG TGG 123 
Ser Ala Gly Pro Ala Trp 



:2) IMFO?:*--.TION FOR SEQ ID NO: 57: 
ii: 3£QU£MCZ CHARACTERISTICS: 
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LENGTH: 345 bass pairs 
|3) TYPE!: NUCLEIC ACID 
(C> STRANDEDNES5: DOUBLE 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(i.x) FEATURE: 

(A) NAT-IE/KEY: sig^peptide 
(3) LOCATION: iOs/.TeS 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 8.8 

seq LLWALLFMQSLWP/QL 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 57: 



AAAGATACTG ACTG.^U^CATG GCTGGCGGAC TCAGGCTGGG GTCTGCAGTG CAGCATTA^iT 50 

GGGCCGCTGA CATGAATATG GAGTAGTTTT CTCTAGCAAA GAGTA ATG TGG GCC ATG 117 

Met Trp Ala Met 
-20 

GAG TCA GGC CAC CTC CTC TGG GCT CTG CTG TTC ATG CAG TCC TTG TGG 165 
Giu Ser Giy His Leu Leu Trp Ala Leu Leu Phe Met Gin Ser Leu Trn 
-15 -10 -5 

CC? CAA CTG ACT GAT GGA GCC ACT CGA GTC TAC TAG CTG GGC A7C CGG 213 
Pro Gin Leu Thr Asp Gly Ala Thr Arg Val Tyr Tyr Leu Gly lie Arg 
15 10 15 

GAT GTG CAG TGG AAC TAT GCT CCC AAG GGA AGA A-^T GTC ATC ACG AAC 261 
Asp Val Gin Trp Asn Tyr Ala Pro Lys Gly Arg Asn Val lie Thr Asn 
20 25 30 

CAG CCT CTG GAC AGT GAC ATA GTG GCT TCC AGC TTC TTA AAG TCT GAC 309 
Gin Pro Leu Asp Ser Asp lie Val Ala Ser Ser Phe Leu Lys Ser Asp 
35 40 45 

AAG AAC CGG ATA GGG GGA ACT ACA AGA AGA CCA TGG 345 
Lys Asn Arg lie Gly Gly Thr Thr Arg Arg Pro Tro 
50 55 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 6 base pairs 

(B) TYPE: NUCLEIC ACID 
{C) STRANDEDNES3: DOUBLE 
(D) TOPOLOGY: LINEAR 

(ii; MOLECULE TYPE: CDN;. 

(vi*. ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
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iT) TISSUE TVPS: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 100.. 159 

(C) IDEKTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 8.3 

seq LLVKGSLPSASWS/LP 

(xi: SEQUENCE DESCRIPTION: SEQ ID NO: 58: 



ACGGTGCAGG GCAGAGAAGG AGCAGCCTTG GACTGGGGAT CCTGAGTAGT CCTGTCTGGG 60 

AATGGAGGGC ACTGAATTGG CACCCTCCTT GGAGGCCAC ATG GCC CAA AC A TGG 114 

Met Ala Gla Thr Trp 
-20 

GCA TTD CTG CTG GIG ATG GGA TCT CTC OCT TCT GCC AGC TGG TCT CTG 162 
Ala Xaa Leu Leu Vai Met Giy Ser Leu Pro Ser Ala Ser Tro Ser Leu 
-15 -10 -5 ■ i 

CCC TGT TTG AGC TGG GAA AGT TTG CTG AAG GCT GCA GCC TGT TCT Q^C 210 
Pro Cys Leu Ser Trp Glu Ser Leu Leu Lys Ala Ala Ala Cys Ser Glu 
5 10 15 

TTG GAT GGT AGA AAT GTA GGA AAT ACA CCA AC? CGG 24 6 

Leu Asp Giy Arg Asn Val Gly Asn Thr Pro Thr Arg 
20 25 



(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 base pairs 

(B) TYPE: NUCLEIC ACID 
(CI STRANDEDNESS : DOUBLE 
(D) TOPOLOGY: LINE.z^ 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 130.. 195 

(CJ IDENTIFICATION METHOD: Von Heijne matrix 
CD) OTHER INFORMATION: score 8.7 

scq LITLLYVWPVINA/CQ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 



ATGAATGAGT GTT.^ATAGGC AATTTT.aAAG GACAG.aACC? CTGGGGAACC ATCCTGCAGT 60 
TCTCCATGTG T.ACTTAAG7T GATTTTGGAA ACCAGAAACA TATACKACTT CCTTAG.QAGT 120 
TCTACATTG XrZ AAA TGT GGG TTT CTG GCT TAC TTG CTA ATC ACA CTC TTG 171 
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..Mec Lys Cys Giy ?he Leu Aia Tyr Leu Leu lie Thr Leu Leu 
-20 -L5 -10 

TAT GTT TGG CCA GTT ATT AAT GCT TGC CAG 201 
Tyr Val Trp Pro Val He Asn Aia Cys Gin 
-5 ■ 1 



(2) INTDRNL2^TI0N TOR SSQ 10 NO: SO: 

ii) SEO'JENCE CHARACTERISTICS: 

;A) LENGTH: 128 base oairs 
(S) TYPE: NUCLEIC ACID 

(C) STRANDEDNES3 : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hono Sapiens 
(F) TISSUE TVPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 21.. 95 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 8.5 

seq LKVLLLPLAPAAA/QD 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



AGGGCGGATC TTCTCCGGCC ATG AGG AAG CCA GCC GCT GGC TTC CTT CCC TCA 53 

Met Arg Lys Pro Aia Ala Gly Phe Leu Pro Ser 
-25 -20 -15 

CTC CTG AAG G?G CTG CTC CTG CCT CTG GCA CCT GCC GCA GCC CAG GAT 101 
Leu Leu Lys Vai Leu Leu Leu Pro Leu Ala Pro Ala Ala Ala Gin Asp 
-10 -5 1 

TCG ACf CAG GCC TCC ACT CCA GGC AGG 128 
Ser Thr Gin Aia Ser Thr Pro Gly Arg 
5 10 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(iii C-IOLECULE TYPE: CDNA 

(vn ORIGIN.iVL SOURCE: 

(A) ORGANISM: Homo Sapiens 
£-) TISSUE TYPE: Spleen 
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f-xl FEATURE: 

(A) NAM2/KEY: sia_ceotide 
(3) LOCATION: 152/. 202 

(C) IDENTiriCATIOM METHOD: Vcn Heijne rr.atrix 

(D) OTHER INFORMATION: score 3.4 

seq L-FLT3Vy?rVLA/?=^ 

(x:.) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 



AAGAATCTTC CCAGTAGGCG GCGCGGGAGG GAAAAGAGGA TTGAGGGGCT AGGCCGGGCG 50 

GATCCCGTCC TCCCCCGATG TGAGCAGTTT TCCGAAACCC CGTCAGGCGA AGGCTGCCCA 120 

GAGAGGTGGA GTCGGTAGCG GGGCCGGGAA C IKTG AGO GAG TCT CTC C?A TTC 172 

Met Arg Gin Ser Leu Leu Phe 
-15 

CTG ACC AGC GTG GTT CCT TTC GTG CTG GCG CCG CGA CCT CCG GAT GAC 220 
Leu Thr Ser Val Vai Pro Phe Val Leu Ala Pro Arg Pro Pro Asp Asp 
-10 . -5 1 5 

CCG GGC TTC GGC CCC CAC CAG AGA CTC GAG AAG CTT GAT TCT TTG CTC 268 
Pro Giy Phe Giy Pro His Gin Arg Leu Giu Lvs Leu Aso Ser Leu Leu 
10 15 ■ 20 

TCA GAC TAG GAT ATT CTC TCT TTA TCT AAT ATC CAG CAG CAG CSG 313 
Ser Asp Tyr Asp lie Leu Ser Leu Ser Asn lie Gin Gin Gin Xaa 
25 30 35 



(2) INFORMATION FOR SEQ ID NO: 62: 

(i: SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(iij MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/ECEY: sig_peptide 

(B) LOCATION: 29.. 103 

(C) IDENTIFICATION METHOD: Von Heijne macri.x 

(D) OTHER INFORMATION: score 8.1 

seq 3V1LGLLALMA7A/AV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 



AAGC7GAGGT GGCAGTGGTT CCACCAAC ATG GAG CTC TCG CAG ATG TCG GAG 52 

Met Giu Leu Ger Gin Met Ser Giu 
-25 -20 



CTC .^iTG GGG CTG TCG GTG TTG CTT GGG CTG CTG* GCC CTG .ATG 333 .ACG 



100 
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Leu Met G-.y Leu Ser Vai Leu Leu Gly Leu Leu Ala Leu Met Ala Th- 
--^ -10 -5 

GCG GCG GTA GCG CGG GGG TGG CTG CGC GCG GGG GAG GTG AGG 142 

Aia Ala V2i Aia Arg Gly Trp Leu Arg Ala Gly Glu Val Arc 
1-5 10 ' 



(2) rNFCRMATION FOR SEQ ID NO: 63: 

{i: SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii; MOLECULE TYPE: CDNA 

(vi; ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
IF) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
{B) LOCATION: 50.. 244 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 8 

seq LTLIGCLVTGVES/KI 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 



AAGGAAAGGA TTACTCGAGC CTTGTTAGAA TCAGACATGG CTTCAGGGG ATG CAG GAG 58 

Met Gin Asp 
-65 

GCT CCC CTG AGC TGC CTG TCA CCG ACT AAG TGG AGC AGT GTT TCT TCC 106 
Ala Pro Leu Ser Cys Leu Ser Pro Thr Lys TrD Ser Ser Vai Ser Ser 
-60 -55 * -50 

GCA GAC TCA ACT GAG AAG TCA GCC TCT GCG GCA GGC ACC AGG A-AT CTG 154 
Aia As? Ser Thr Giu Lys Ser Aia Ser Aia Aia Gly Thr Arg Aiiii Leu 
-45 -40 -35 

CCT TTT CA3 TTC TGT CTC CGG CAG GCT TTG AGG ATG AAG GCT GCG GGC 202 
Pro Phe Gin Phe Cys Leu Arg Gin Ala Leu Arg Met Lys Aia Aia Gly 
-30 -25 -20 -15 

ATT CTG ACC CTC ATT GGC TGC CTG GTC ACA GGC GTC GAG TCC ATC 250 

lie Leu Thr Leu lie Gly Cys Leu Vai Thr Giy Val Glu Ser Lvs lie 
-10 -5 ' 1 

TAC ACT CGT TGC AAA CTG GCA AAA ATA TTC TCG AGG GCT GGC CTG GAC 298 
Tyr Thr Arg Cys Lys Leu Ala Lys lie Phe Ser Arg Aia Civ Leu Aso 
5 10 15 ' ■ 

AAT CYG ~ZZ GGC TTC AGC CTT GGA AAS TGG ATC TGC ATG GCC TAT TAT 34 6 
As:: Xai Ar:; Giy Phe Ser Leu Giy Xaa Trp lie Cys Met Aia Tvr Tyr 
20 25 30 
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GAG AGC G3C TOG 353 
Giu Ser Ziy Trp 
35 



(2) INFORMATION FOR SZQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDMESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig_pe?tide 

(B) LOCATION: 24.. 311 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 7.6 

seq ALCGLCLLCPRAA/RV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 



ATTTCTCCTG GCACCCTGTA TTC ATG GCC TTG GCG TTC TGC CTC TGC ATG GCT 53 

Met Ala Leu Ala Phe Cys Leu Cys Met Ala 
-95 -90 

GAA GCC ATC CTA CTC TTC TCA CCT GAA CAC TCC CTG TTC TTC TTC TGC 101 

Glu Ala tie Leu Leu Phe Ser Pro Glu His Ser Leu Phe Phe Phe Cys 
-85 -30 -75 

TCC CGA AAA GCA CGG ATC CGG CTC CAC TGG GCA GGG CAG ACC CTA GCC 14 9 

Ser Arg Lys Ala Arg lie Arg Leu His Trp Ala Gly Gin Thr Leu Ala 
-70 ^ -65 -50 -55 

ATC CTC TGT GCA GCT CTG GGC CTG GGC TTC ATC ATC TCC AGC AGG ACC 197 

lie Leu Cys Ala Ala Leu Gly Leu Gly Phe lie He Ser Ser Arg Thr 

-50 -45 -40 

CGC AGT GAG CTG CCT CAT CTG GIG TCC TGG CAC AGC TGG GTG GGA GCC 24 5 

Arg Ser Glu Leu Pro His Leu Val Ser Trp His Ser Trp Val Gly Ala 
-35 -30 -25 

CTG ACA CTG CTG GCC ACT GCT GTC CAG GCA CTG TGT GGG CTC TGC CTC 2 93 

Leu Thr Leu Leu Ala Thr Ala Val Gin Ala Leu Cvs Gly Leu Cys Leu 
-20 -15 ' -10 

CTT TCT CCC CCC CCA CCC AGG GTC TCA AGG GTG GCT CGC CTC AAG CTC 341 

Leu Cys Pro Arg Ala Ala Arg Val Ser Arg Val Ala Arg Leu Lys Leu 

-5 1 ' 5 10 

TAC CAT CTG ACA TGT GGA CTG GTG GTC TAC CTG ATG GCT ACA GTA ACG 35 9 

Tyr His Leu Thr Cys Gly Leu Val Val Tyr Leu Met Ala Thr Val Thr 

15 20 25 
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GTG CTT C7G GGC ATG TAG TCA GTA TGG TTC 
Vai Leu Leu Gly Met Tyr Ser Val Tro ?he 
30 3*5 



{?.) ZNTORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHAR-QiCTERISTICS : 

(A) LENGTH: 336 base pairs 

(B) TYPE: NUCLEIC ACI3 

(C) STRANOEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

{ill MOLECULE TYPE: CONA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: 37.7207 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(DJ OTHER INFORMATION: score 7.5 

seq LLHRLASFHRVWS/FP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



AAGKCTGCCG GTGGGGACTC TTGCAGGGCC GTCCCC ATG TTR CGT TTT CCG ACC 54 

Met Leu Arg Phe Pro Thr 
-55 

TG7 TTC CCA TCC KTC CGG GTG RTG GGA GAK AAG CAG CTC CCG CAG GAG 102 

Cys Phe Pro Ser.Xaa Arg Val Xaa Gly Xaa Lys Gin Leu Pro Gin Glu 
-50 -45 -40 

ATT ATT TWC CTG GTC TGG TCG CCC AP.K CGG GAT CKC ATT GST TTG GCC 150 
lis He Xaa Leu Val Trp Ser Pro Xaa Arg Aso Xaa He Xaa Leu Ala 
-35 * -30 -2*5 -20 

AA.C ACA GCT GGC GAG GTT TTA CTT CAT CCA CTG GCA AGT TTT CAT CGA 193 
Asn Thr A.la Gly Glu Val Leu Leu His Arg Leu Ala Ser Phe His Arg 
-15 -10 -5 

GTT TGG AGT TTT CCA CCA AAT GAA AAT ACA GGA AWK GAG GTG ACG TGT 24c 
Vai Trp Ser Phe Pro Pro Asn Glu Asn Thr GIv Xaa Glu Val Thr Cys 
1 5 " 10 

CTG GCA TGG AGA CCA GAT GGC AAA CTT TTG GCC TTT GCT CTT GCT GAT 294 
Leu Ala Trp Arg Pro Asp Gly Lys Leu Leu Ala Phe Ala Leu Ala Asp 
15 20 25 

ACC AAG .^-AA ATT GTT TTG TGT GAT GTA GAA AAA CCT GAG AGC 33c 
Tr.r Lys ^ys He Val Leu Cys Asp Val Glu Lvs Pro Giu Ser 
2': 35 40 
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(i) SEQUSN'CE CHARACTERISTICS: 

(A) LENGTH: 398 base pairs 
(S) TYPE: taUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPCLOGY: LINEAR 

MOLECULE TYPE: CDNA 

ivi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig^peotide 

(B) LOCATION: 9.. 134 

(CJ IDENTIFICATION METHOD: Von Heijne mamx 
(D) OTHER INFORMATION: score 7.5 

seq LALWALVAERFA/RR 

(xi) SEQUsNCt: CEbCRIPTION: SEQ ID NO: 66: 



AGACCTTC ATG TTC ATG GTG CTG GAG GTG GTG GTG AGC CGG GTG ACC TCG 50 
Met Phe Met Val Leu Giu Vai Vai Val Ser Arg Val Thr Ser 
-40 -35 -30 

TCG CTG GCG ATG CTC TCC GAC TCC TTC CAC ATG CTG TCG GAC GTG CTG 98 
Ser Leu Ala Met Leu Ser Asp Ser Phe His Met Leu Ser Asp Val Leu 
-25 -20 -15 

GCG CTG GTG GTG GCG CTG GTG GCC GAG CGC TTC GCC CGG CGG ACC CAC 14 6 
Ala Leu Val Vai Aia Leu Val Ala Giu Ary Phe Ala Arg Arg Thr His 
-10 -5 1 

GCC ACC CAG AAG AAC' ACG TTC GGC TGG ATC CGA GCC GAG GTA ATG GGG 194 
Ala Thr Gla Lys Asn Thr Phe Gly Trp lie Arg Ala Giu Val Met Giy 
5 10 15 20 

GCT CTG GTG AAC GCC ATC TTC CTG ACT GGC CTC TGT TTC GCG ATC CTG 24 2 
Ala Leu Val Asn Ala lie Phe Leu Thr Gly Leu Cys Phe Ala lie Leu 
25 30 35 

CTG GAG GCC ATC GAG CGC TTC ATC GAG CCG CAC GAG ATG CAG CAG CCG 290 
Leu Giu Aia lie Giu Arg Phe lio Giu Pro His Giu Met Gin Gin Pro 
40 45 50 

CTG GTG GTS CNT TGG GGT CGG GCG TGG CSG GCT GCT GGT C?Ji CGT GCT 338 
Leu Vai Val Xaa Trp Giy Arg Aia Trp Xaa Aia Ala Giy Gin Arg Aia 
55 60 65 

GGG GCT CTG CCT CTT CCA CCA TCA CAG CGG CTT CAG CCA GGA CTC CGG 33 5 
Giy Aia Leu Pro Leu Pro Pro Ser Gin Arg Leu Gin Pro Giy Leu Arg 
70 75 80 

CCG CGG CCA TGG 393 

Prry Arg Pro Trp 
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(2) INFORMATION FOR SZQ ID MO: 67: 

it) SEOUEMCE CHARACTERISTICS: 

(A} LENGTH: 306 base pairs 
(3; TYPE: MUCLEIC ACID 
{C; STRAiNDEDNESS: DOUBLE 
(D5 TOPOLOGY: LINEAR 

(li) MOLECULE TYPE: CDNA 

(vi) ORIGIMAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY; sig^pepticte 

(B) LOCATION: 70.. 18 6 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: scare 7,5 

seq LLLILGLIVLVNI/GI 

(xi) SEQUENCE DESCRIPTION: 3EQ ID NO: 67: 



AASTGTGSST TGGGGCCGGG GGTGGGGGGC AGAGGGGGGT GGCCCAGGTG GCCCTAGGAC 60 

CCCCCCTCC ATG GAA AAC GAG CTA TGG CAT AAC ACC GTG AGA TGT TGC AAT 111 
Met Giu Asn Gin Leu Trp His Asn Thr Vai Arg Cys Cys Asn 
-35 -30 

CAA TAC CAA GAJ^ ACC CCC CAC GAT GCC GAG GAC ATC TTA CTC CTG CTG 159 
Gin Tyr Gin Glu Ser Pro His Asp Ala Glu Asp lie Leu Leu Leu Leu 
-25 -20 -15 -10 

CTG GGC CTC ATC GTT CTT GTC AAC ATT GGC ATC AAC GTG GCA ACT ATG 207 
Leu Gly Leu He Val Leu Val Asn He Gly He Asn Vai Ala Thr Met 
-5 1 5 

ATG TGG CAT GGA CTC CAG AAC GCC TTA GAC AAG ATG ATT GAT TGG GCT 255 
Mec Trp His Gly Leu Gin Asn Ala Leu As? Lys Met lie Asp Trp Ala 
10 15 20 

ACT CAG AAA ATA GCA GTC TTC TTC GCT GTG TTC GTC GCC GCC GCC GCC 303 
Thr Gin Lys He Ala Val Phe Phe Ala Vai ?he Vai Ala Ala Ala Ala 
25 30 35 

CGG 305 
Arg 
40 



(2> INFORMATION FOR SEQ ID NO: 68: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 base pairs 
(3) TYPE: NUCLEIC ACID 
(C) STRANDEDNESS: DOUBLE 
iO) TOPOLOGY: LINEAR 
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MOLEC'JLi 7Y?E: CDMA 

(VI) ORIGINAL SOURCE: 

(A) ORGANISM; Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix; FEATURE: 

(A) NAuME/KEY: sig_peptide 
(3) LOCATION; 23.. 7 6 

(C) IDE:JTIFICATI0N method; Vcn Heijne r.acri.^c 

(D) OTHER INFORMATION: score 7.5 

seq ITLLTLSPNSVCC/C? 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 



AATACTGAGG TATATTGCCA AA ATG CTC TCC AKW AAG ATC ACC CTC TTG ACA 52 

Met Leu Ser Xaa Lys lie Thr Leu Leu Thr 
-15 -10 

CTG TCA CCC AAT AGT GTG TGT TGC TGC CCC TCA GCA ACC CTC GGT GCC 100 
Leu Ser Pro Asn Ser Val Cys Cys Cys Pro Ser Ala Thr Leu Gly Ala 
-5 1 5 

AGC AAT CAT TCT CAT CTT TGG AGA TCT ACT AGC AGA CAT GGC ATC TCC 148 
Ser Asn Kis Ser His Leu Trp Arg Ser Thr Ser Arg His Gly lie Ser 
10 15 20 

TTC CCA TGG GCA TTC CTT TTA ATT AAC GGG 178 
Phe Pro Trp Ala Phe Leu Leu lie Asn Giy 
25 30 



(2) INFORM.ATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRAMDEDNESS: DOUBLE 
<D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 79.. 132 

(C) IDEMTIFICATION METHOD: Von Heijne matri.v. 

(D) OTHER INFORMATION: score 7.4 

seq GWLVLCVLAISLA/SM 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



CTGCTGGGCA GCCCAC.AGGG TCCCTGGGCG G.AGGGCAGGA GC.ATCCMGTT GG.AGTTGACA cC 
ACAGGAGGCA GAGGCATC .ATG GAG GGT CCC C3G GGA TGG CTG CTC CTC TCT 111 
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M^t Giu Giy Pro Arg Giy Trp Leu Vai Leu Cvs 
-15 -10 

GTG CTG GCC ATA TCG CTG GCC TCT ATG GTG ACC GAG GAG 7TG TGC CGA 159 
Vai Leu Ala lie Ser Leu Aia Ser Met Vai Tar Giu Asp Leu Cvs Arg 
-5 * 1 5 " 

GCA CCA GAC GGG AAG AJ3J\ GGG GAG GCA GGV AVA CCT GGC AGA CGG GGG 23-7 
Aia Pro Asp Giy Lys Lys Giy Giu Ala Giy Xaa Pro Giy Arq Arg Giy 
IC 15 20 * 25 

CGG CCA GGC CTC AAG GGG GAG CAA CGG 234 
Arg Pro Giy Leu Lys Giy Giu Gin Arg 
30 



(2) INFORMATION FOR SEQ ID NO: 70; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 364 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(c) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 41.. 100 

(C) IDENTIFICATION METHOD: Von Heijn.e matrix 

(D) OTHER INFORMATION: score 7.3 

seq LAVFMLLAQLVSG/NW 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7C: 



AATAGAGACT TCTGGACTCT ATAGAACCCA CTGCCTCCTG ATG AAG TCC CTA CTG 55 

Met Lys Ser Leu Leu 

-20 

TTC ACC CTT GCA GTT TTT ATG CTC CTG GCC CAA TTG GTC TCA GGT AAT 103 
Phe Thr Leu Ala Vai Phe Met Leu Leu Aid Gin Leu Vai Ser Giy Asn 
-15 -10 -5 1 

TGG TAT GTG AAA AAG TGT CTA AAC GAC GTT GGA ATT TGC AAG AAG AAG 151 
Trp Tyr Vai Lys Lys Cys Leu Asn Asp Vai Giy lie Cys Lys Lvs Lys 
5 lb 15 

TGC AJ\A CCT GAA GAG ATG CAT GTA AAG AAT GGT TGG GCA ATG TGC GGC 199 
Cys Lys Pro Giu Giu Met His Vai Lys Asn Giy Tro Aia Me*: Cvs Giv 
20 , 25 * * 30 ' * 

AAA CAA AGG GAC TGC TGT GTT CCA GCT GAC ACA CGT GCT AAT TAT CCT 24 7 
Lys Gin Arg Asp Cys Cys Vai Pro Aia Asp Arg Arg Ala Asn Tyr Pro 
35 40 45 
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GTT TTC TGT GTC CAG ACA AAG ACT ACA AGA ATT TCA ACA CTC ACA GCA 

VaL rhe Cys Vai Gin Th.r Lys Thr Trir Arg lie Ser Thr Va: Thr Ala 
3J 55 60 65 

ACA ACA GCA ACA ftCA ACT TTG ATG ATG ACT ACT GCT TCG ATG TCT TCG 

T'nr T.^.r Ala Thr Thr Thr Leu Met Met Thr Thr Aia Ser Ket Ser Ser 

70 75 30 . 



ATG GCT CCT ACC CGT TTC TCC 
Met Ala Pro Thr Arg Phe Ser 
35 



364 



(2; IMrORMATION TOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 
(3) TYPE: NUCLEIC ACID 
(C) STRANDEDNESS: DOUBLE 
(01 TOPOLOGY: LINEAR 

•ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(E) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 9.. 56 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 7.3 

seq LILLFSLLISIVC/MI 

(>ci) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 



ATAGTA.2JV ATG TTA AAG TTG ATC TTA CTT TTT TCG CTC CTC ATC TCT ATT 
Met Leu Lys Leu lie Leu Lea P:ie Ser Leu Leu lie Ser lie 
-15 -10 -5 



50 



GTT TGT ATG ATT 
Vai Cys Met lie 
1 



62 



(2- INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 296 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii: MOLECULE TYPE: CDNA 

(vi: ORIGINAL SOURCE: 

unj ORGANISM: Homo Sapiens 
(") TISSUE TYPE: Testis 
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(A) NAM£/K£V: siq^Deptice 
;B) :.0CATICN: 195. '.272 

:C) IDENTIFICATION METHOD: Von Heijne matrix 
■0) OTHER INFORMATION: score 7.1 

seq LASLQWSLTLAWC/GS 

(xi; SEQUENCE DESCRIPTIOM: SEQ ID NO: 72: 



AAAGTGTAGA ACACGGACCT CTGAGTTATG CTCTTGAGAG GTGCCAAAGC TGGGCTGTT7 60 

ACCTACCTTA TCCACAGAGC TCTGAAAGTC AAGCCAGAAA GGAAGGATTC CAAATTCTTG 120 

GAATTTTATC TAGAAAAGAA GACTAAGCAG CTTTTGTTCT TCTGTGACCC AGTTGCTGGC 18C 

CCAAGACATG GACA ATG ACC CCC TGG TGT TTG GCG TGT CTG GGG AGG AGG 230 
Met Thr Pro Trp Cys Leu Ala Cvs Leu Gly Arg Arg 
-25 -20 -15 

CCT CTC GCT TCT TTG GAG TGG AGC CTG ACA CTG GCG TGG TGT GGC TCC 273 
Pro Leu Ala Ser Leu Gin Trp Ser Leu Thr Leu Ala Trp Cys Gly Ser 
-10 -5 1 

GGC AGC CAC TGG ACA GAG * 296 

Gly Ser His Trp Thr Glu 
5 



(2} INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: D0U3LE 
CD) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 15lT.228 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.9 

seq LWVLLLCAHWTL/LV 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 



AACACGCAGC TAGACACAGC TAMCTTGAGT CTTGGAGCTC CTAGAGGGAJ^ GCTTCTGGAM 60 

AGGAAGGCTC TTCAGGACCT CTTAGGAGCC AGAGMMSMGG ACGTXSACAC AGA7AAAGAG 120 

CCAGGCTCAC CAGCTCCTGA CGCATGCAK3 ATG ACC ATG A3A CAC .VwC TGG ACA il4 

Me- Thr Met Arg His Asn Trp Thr 
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-25 



-20 



CCA GAG CTC AGC CCT TTG TGG GTC CTG CTC CTG TGT GCC CAC GIC GTC 
Fro Asp Leu Ser Pro Leu Trp Val Leu Leu Leu Cvs Aia His Vai Va* 
-15 ■ -iO " -5 



222 



ACT CTC CTG GTC AGA GCC ACA CCT GTC TCG CAG ACC ACC ACA GCT GCC 
Thr Leu Leu Vai Arg Aia Thr Pre Val Ser Gin Thr Thr Thr Aia Aia 
: ■ 5 10 



270 



ACT GCC TCA GTT AGA AGC ACA AAG GAC CCC TGC CCC TCC CAG CGG 
Thr Aia Ser Vai Arc Ser Thr Lys Asp Pro Cys Pro Ser Gin Arg 
15 20 ' 25 



315 



(2) INFORt-lATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 27.786 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.9 

seq LFCATL3CMPATS/AP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 



AAGCCTACTT TGACACTCAT TTAAAG ATG ACA GGG AAC AAT AGA GAT TTG TTC 53 

Met Thr Giy Asn Asn Arg Asp Leu Phe 
-20 -15 

TGT GCA ACC CTT TCT TGT ATG CCG GCG ACA TCA GCT CCG CAC ATG AAA 101 
Cys Aia Thr Leu Ser Cys Met Pro Aid Thr Ser Aia Pro His Met Lvs 
-10 -5 1 *5 

CTG CCC GAT ATT TCA TTC CAC CTG CCC GGG 131 
Leu Pro Asp lie Ser Phe His Leu Pro Giy 
iO 15 



(2) INF0R£-L\7I0N FOR SEQ ID NO: 75: 

(i; SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 base pairs 
;B) TYPE: NUCLEIC ACID 
(C) STRANDEDNESS: DOUBLE 
:D) TOPOLOGY: LINEAR 



wo 99/06549 PCT/IB98/0I231 



(ii) MOLECULi TY?E: CDNa 

(vi) ORIGINAL SOURCE: 

ORGANISM: Home Sapiens 
(D TISSUE TYPE: Ovary 

(im: FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 1147.191 

(C) IDENTIFICATION METHOD: Von Heijna matrix 

(D) OTHER INrORRATION: score 6,9 

seq LWVLLLCAHVVTL/LV 

{:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 



ACTTCCCAGA AGCA3CTCTG GTGCTGAAGA GAGCACTGCC TCCCTGTGTG ACTGGAGAAG 60 

AGGACGTTGT CACAGATAAA GAGCCAGGCT CACCAGCTCC TGACGCATGC ATC ATG 115 

Met 

ACC ATG P.C-A CAC AAC TGG ACA CCA GAC CTC AGC CCT TTG TGG GTC CTG 164 
Thr Met Arg rlis Asn Trp Thr Pro Asp Leu Ser Pro Leu Tro Vai Leu 
-25 -20 * -15 * -10 

CTC CTG TGT GCC CAC GTC GTC ACT CTC CTG GTC AGA GCC ACA CCT GTC 212 
Leu Leu Cys Ala His Val Vai Thr Leu Leu Val Arg Ala Thr Pro Vai 
-5 1 5 

TCG CAG CCC ACG 224 
Ser Gin Pro Thr 
10 



(2i INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
13) LOCATION: 79.7"l38 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
OTHER INFORMATION: score 6.9 

seq LYLLGMLVPGGLG/YD 

SEQUENCE DESCRIPTION: SEQ ID NO: 76: 



ACTTGATGCA TTCATTCTCA AGGACACTTG ATCCACTCCC AGACAGCCCC AGiJ-TTTTCT 



wo 99/06549 



58 



PCr/IB9&'OI23I 



AACTTACTGT GT3GCAGA AT3 AAG CCT CTG CTT GAA ACC CTT TAT CTT TTG 111 
Met Lys Pro Leu Leu Glu Thr Leu Tvr Leu Leu 
-20 -15 -iO 

GGG ATG CTG G77 CCf GGA GGG CTG GGA TAT GAT AGA TCC TTA GCC CA.A 159 
Gly Hez Lcj Val Pro Giy Giy Leu Gly Tyr Aso Arq; Ser Leu Ala Gin 
-5 1 ■ • 5 

CAC AGA CX\ GAG ATT GTG GAC AAG TCA GTG ACT CCA TGG AGC CTG GAG 207 
His Arg Gin Giu lie Vai Asp Lys Ser Val Ser Pro Tro Ser Leu Glu 
IC * 15 20 

ACG TAT TCC TAT AAC ATA TAG CAC CCC ATG GGA GAG ATC TAT GAG TGG 255 
Thr Tyr Ser Tyr Asn lie Tyr His Pro Met Gly Giu lie Tyr Glu Trp 
25 30 35 

ATG AGA GAG ATC AGT GAG AAG TAC AAG GAA GTG GTG ACA CAG CAT TTC 303 
Met Arg Giu lie Ser Glu Lys Tyr Lys Glu Val Val Thr Gin His Phe 
40 45 50 55 

CTA GGA GTG ACC TAT GAG ACC CAA CCC GCG 333 
Leu Giy V2I Thr Tyr Glu Thr Gin Pro Ala 
60 55 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 
iO) TOPOLOGY: LINEAR 

(ii; MOLECULE TYPE: CDNA 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig_pepcide 

(B) LOCATION: 80,. 274 

(C) IDENTIFICATION METHOD: Von Hei^ne matrix 
{0) OTHER INFORMATION: bcjore 6.3 

seq LLFLISLAAHLSQ/WT 

{xi) SEQUENCE DESCRIPTION: SEQ ID SO: 77: 



AAAGTATTGG GGATGCTGAG CTGCGGGG7A CGGGCCTGAG GAGGGATGGG AGTAAGAAG? 60 

GCTGTGGA.AA CCGTCAGCC ATG AAC CAG GCT GAC CCT CGG CTC AGA GCA GTG 112 

Met Asn Gin Ala Aso Pro Arg Leu Arg Ala Val 
-65 * -50 -55 

TGC TTG TGG ACT CTC ACA TCT GCA GCC ATG AGC AGA GGC GAC .---.C TGC 160 
Cys leu Trp Tr.r Leu Thr Ser Ala Ala Me- Ser Ar; Gly Asp Asn Cys 
-50 -45 -O 
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ACG GAT CTA CTC 3CA CTG GGA ATC CCC TCC ATA ACC CAG GCC TGG GGA 
Tp.r Asp Lau L^u Aia Leu Giy He Pre Ser He Tar Gin Ala Tro Cly 
-35 -30 -25 

CTG TGG GTC CTC TTA GGG GCT GTG ACG CTG CTA 7TT CTC ATC TCG CTG 
i-eu Trp Val Leu Leu Giy Aia Val Thr Leu Leu Phe Leu lie Ser Leu 
-20 -15 -10 

GCT GCA CAC TTG TCC CAG TGG ACC AGG GGT CGG AGC GGG 
Aia Aia His Lea Ser Gin Trp Thr Arg Giv Arg Se*- Giy 
-5 1*5 



:2) rMFORMATICN FOR SZQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : 0OU3LE 

(D) TOPOLOGY: LINEAR 

(ii) t-:OLECULE TYPE: CDNA 

(VL) ORIGINAL SOURCE: 

{A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 317.. 442 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.8 

seq LLSILSSLTMVIC/RH 

i>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 



ACT AC AC AG A 


GAGAAGCC^T 


CATTCTAGCT AGAC.=l.^\GA-AG 


CTCGGGAAGA ATTACTTTTA 


60 


CATCAGAGTG 


i-ATGGGAGGG 


AAGAATATCT CCCGAGCAGG 


TTGACACCTC TTCCTTACCC 


120 


CTAGT.ACCAC 


AGCATTCATT 


CGCCTCATTA CC7CTTAATG 


AATCTGAAAG AAACCAAGAA 


180 


CCATGTTCAA 


TTAACAGTGA 


TAATATAGTA TCCTCAGGTC 


ACTCAGAGAT ACCAACATTG 


240 


CCTGATGGGC 


TGTTGGGTTT 


ATCACATCTT GTTTTACCTC_ 


.=iACAAGATAA TTTGATTGCA 


300 


CTTGA.^G/u\C 


ACTTGC ATC 
Met 


CAC AGA CAG ATT TCC TTC CTT CTA TTG AGA AAA 
His Arg Gin lie Ser Phe Leu Leu Leu Arg Lys 
-40 -35 


352 


CCC AGA A.AG 
Pro Arg Lys 
-30 


AAT TGG TTT TGT CAA AAC CAT GTA 
Asn Trp Phe Cvs Gin Asn His Val 
-25 -20 


AAT TTG AGG AAA AGG 
Asn Leu Arg Lys Arg 

-15 


400 


T.-.T CTT CTG 


AGC ATT TTA TCC AGT CTC ACC ATG 
Ser lie Leu Ser Ser Leu Thr Met 
-10 -5 


GTG .ATT TGC AGA CAC. 
Val lie Cys Arg His 
1 


443 
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(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHAFACTERISTICS : 

(A) LENGTH: 317 base pairs 
'3} TYPE: NUCLEIC ACID 
■:C) STRANDEDNES3 : DOUBLE 
iZ) TOPOLOGV: LINEAR 

(ii; MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISD^: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 162.. 290 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.8 

seq ALSAXTFVSFLHA/AP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



AGTACGGATC TCTTTAATAT TCTGTGTAAC AAAATAGAAA TGCTCATAAA GTAC7TCTGC 60 

GGCAAACCAA AGTATAGCAC CTGACTCAAG GAAAAGCAAG GAAAAGCACA TGTGGGATCC 120 

CTTGAATGGC Ai^GTGAAACT AGCCACTAGT TTCATTTTTA C ATG AAA CAA TGG CTG 176 

Met Lys Gin Trp Leu 
-40 

TGT TGG GTG CTG AGG CTG GAA GGT AGA CAG GGG CTT GGG GTT GGA GAG 224 
Cys Trp Vai Leu Arg Leu Giu Giy Arg Gin Giy Leu Gly Vai Gly Giu 
-35 -30 -25 

OCT CGT GGG CTG GGT TTG TGC TTG GGG GCC TTG AGC GCA SCC ACC TTT 272 
Pro Arg Gly Leu Arg Leu Cys Leu Gly Ala Leu Ser Ala Xaa Thr Phe 
-20 -15 -10 

GTC AGC TTT CTA CAC GCT GCT CCC CAC TCC CAT CCA GCC CTT GGG 317 
Vai Ser Phe Leu His Ala Aia Pro His Ser Hls Pro Ala Leu Giy 
-5 1 5 



(2) liJEORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

{Al LENGTH: 235 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNES3 : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) :r:ginal source: 
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•A) ORGAMISM: Homo Saoieas 
(D TISSUE TYF£: Ovary 

(ix; FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 29.7226 

(C) I DENT in CAT I ON METHOD: Von Heijne matrix 
(0) OTHER INFORMATION: score 6.8 

ssq LLF?r.FPTLrrR5;/QH 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



kCPATTTCGZ TGTGTCTGGT GTYTTGCC ATG AGA CTG GGG TTA TGC TTC TGG 52 

Met Arg Leu Giy Leu Cys Phe Trp 
-65 -60 

GTG CCA CAC AGA GGT GAA ATG TCC TTC TCA TCA CAT TAT TCG AGA GGT 100 
Val Pro His Arg Giy Glu Met Ser Phe Ser Ser His Tyr Ser Arg Gly 
-55 -50 -45 

ACA TGG TA3 CAA TGG GAC TTA TCG CTG CTG ATG TTA ACC TTG ATC TCT 148 
Thr Trp Tyr Gin Trp Asp Leu Ser Leu Leu Met Leu Thr Leu lis Ser 
-^0 -35 -30 

TGG TTC AGG TGG TGC CTG CCA GCT GTC TCC ACT GTG GAG TTA CTA TTT 196 
Trp Phe Arg Trp Cys Leu Pro Ala Val Ser Thr Val Glu Leu Leu Phe 
-25 -20 -15 

TTC CTT TTC CCC ATT TTA TTC ATC AGA AGO CAG CAC CGG 235 
Phe Leu Phe Pro lie Leu Phe lie Arg Ser Gin His Arg 
-10 -5 I 



(2) IMFORJiATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 390 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(li) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 

iix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 67.7369 

(C] IDENTIFICATION METHOD: Von Heijne r.atrix 

(D) OTHER INFORMATION: score 6.6 

seq IIXVITITSAC5A/CI 

!xi' SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
ACTT-.TTA.-.: AGTACTTGTT TAGATATTTT GTGTCCCTAG .^'TTTCTTTGA TG.-.TTAGGCT €0 
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GCAoTT AT" .G.n? T7T TGG GfiJK G.^A TAG CGC AGA GGT GAT G?G CCC TIC 10* 
Kec Asp Phe Trp Giu Glu Tyr Arc Ara Giy Asp Val Pro Fh^ 
-iOC -95 

TCA TGG TGT CCT ATC AGG AGC TAC C7G ATG TCA GTA T3T CCT GTT ACT 155 
Ser Tip Cys Pre lie" Arg Ser Tyr Leu Met Ser Val Cys Pro Val Thr 
-55 -30 -75 

G3C AAA GTT AAC CTT AA.T CAT TTG GTT AAG GTA GCC TCT GCC AGG TTT 20^ 
Gly Lys Val Asn Leu Asn His Leu Vai Lys Vai Ala Ser Ala Arg Phe 
-70 -65 -60 

CTC CAC C;^^ GTT ACT ATT TTT CCT TTT CTG TAC TCT GTT AAG GCA AAT 252 
Leu His Gin Val Thr He Phe Pro Phe Leu Tyr Ser Val Lys Ala Asn 
-55 -50 -45 -40 

TAT TGC TTT TTA AAT TTT GAT GTA CCT CAA TAT GCA TGG GAG ATA CAT 300 
Tyr Cys Phe Leu Asn Phe Asp Vai Pro Gin Tyr Ala Trp Glu He His 
-35 -30 -25 

AGC TTT GCA GCT CCC TCA ATC TTA ATT GTA ATA ATA ATA GTA ATA ACA 34 3 
Ser Phe Ala Ala Pro Ser lie Leu He Val lie lie lie Val He Thr 
-20 -15 -10 

ATA ACT AGC GCT TGC TCC GCC TGC ATA GTT CTA AAC ACA TGT 390 
He Thr Ser Ala Cys Ser Ala Cvs He Val Leu Asn Thr Cys 
-5 'l 5 



(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 9 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 
[0) TOPOLOGY; LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 59.. 139 

(C) IDENTIFICATION METHOD: Von Keijne matrix 

(D) OTHER INFORMATION: score 6.5 

seq SLSLSTVWNWIQA/Sr 

(xi; SEQUENCE DESCRIPTION: SEQ ID MO; 82: 



AATATTTAGA TTCCTGAAGC TTCTGCAC.AT GTAGTTCCTA G.AGCTGCTGC TTATTAAA 

ATG TCA ACA TCT TCA TCT TCT AGC TGG GAC AAC CTC TTA GAG TCT CTC 
Kzz Ser Tr.r Ser Ser Ser Ser Ser Trp .Aso .Asn Leu Leu Glu Ser Leu 
-25 -20 -15 

TCT CTC AGC .ACA GTA TGG A-AT TGG ATA C*V. CCA ACT TTT TTG GGA GAG 
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Ser Leu Ser Thr Vai Trp Asn Tro 11* Gin Aia Ser Phe Leu Glv Glu 
-10 -5 1 ' 5 

ACT AGT GCA CCT CAG CAA ACA ACT TTG GGA CTA T7A GAT AAT CTT GCT 202 
?hr Sar Aia Pro Gin Gin Thr Ser Leu Gly Leu Leu Asp Asn Leu Aia 
10- 15 20 

CCA GCT GTG CAA ATC ATC TTG AGG ATT TCT TTC TTG ATT TTA TTG GGA 250 
Pro Aia Vai Gin lie lie Leu Arc lie Ser rne Leu lie Leu Leu Gly 
25 30 35 

ATA GGA ATA TAT GCC TTA TGG AAA CGA AGT ATT CAG TCA ACT CAG AAA 2 93 
lie Gly lie Tyr Ala Leu Trp Lys Arc Ser lie Gin Ser lie Gin Lys 
40 45 50 

ACA TTG TTG TTT GTA ATC ACA CTC TAC AAA CTT TAG AAG AAG GGC TCG 34 6 
Thr Leu Leu Phe Vai lie Thr Leu Tyr Lys Leu Tyr Lys Lys Gly Ser 
55 60 ' 65 

GCG 349 
Aia 

70 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 302 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

{ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 27.. 104 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score €.5 

sec LA1G3AGLLWCLA/GF 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



AGCAGACCGG CCGCCGCTTC ACCGGC ATG GTC TTC GCC ACC ATC GGT TTC TCG 53 

Met Vai Phe Aia Thr lie Glv Phe Ser 
-25 -20 

CTG AAG TCG GGC CTG GCC CTT GGC TCG GCG GGC CTG CTG TGG TGC CTG iOi 
Leu Lys Ser Gly Leu Ala Leu Ciy Ser Aia Gly leu Leu Trp Cys Leu 
-15 -10 -5 

GCC GGT TTC TTC GGC TAC GAC ACA CAC CAG CCC ACG GCA CCC AAC GCC 1-;.; 
Ai2 Gly Phe Phe Gly Tyr Asp Thr Gir. Gin Pro Thr Aia Pro Asn Ala 
1 5 iO 15 
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ATC GAG GGC 7AC CGC GTC ATG TCC AGC T?C GGC GTC GGC GCG T.7G 7TC 19" 
lie Glu Gly Tyr Arg Vai Met Ser Ser Phe Giy Val Giy Ai.^ Leu Phe 
20 25 30 

GCC GCC TGC ACG ATC TGC CTG CTG GCS RAC AAG CTG AAC AAG CAG ACG 24 5 
Aic Ala Cys Thr He Cys Leu Leu Ala Xaa Lys Leu Asn Lys Gin Thr 

35 . 40 45 

ACG CTG A.AG ATG GCC GAC GAC C?C GCC CAA CGG CGC CAG . CAG GCC GAC 293 
Thr Leu Lys Met Ala Asp Asp Leu Ala Gin Arg Arg Gin Gin Ala Asp 
50 55 60 

CTT GCG CCG 302 
Leu Aia Pro 
65 



(21 INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCH: CHARACTERISTICS: 

(A) LENGTH: 151 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(0) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

{vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 35.. 76 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.4 

seq VLLLSGSVSVGVC/CA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 



ACAATCTCAC AGTCCGTGGC AGAGCCTTGT CCTG ATG GTT TTA TTG CTT TCT GGC 55 

Met Val Leu Leu Leu Ser Giy 
-10 

AGT GTG AGT GTG GGT GTG TGT TGT GCC TAG TTG TGC ATC TCC ATT TCT 103 
Ser Val Ser Val Gly Val Cys Cys Ala Tyr Leu Cys He Ser lie Ser 
-5 1 5 

AAA ACA CCA ACT GCT TGT GCA TTG TAT GGT CTT TAT TTA CCG TTT TTT 151 
Lys Thr Pro Thr Aia Cys Aia Leu Tyr Gly Leu Tyr Leu Pro Phe Ph« 
10 15 20 25 



(2; INFORMATION FOR SEQ ID NO: 85: 



SEQUENCE CHARACTERISTICS : 
iA) LENGTH: 169 base pairs 
(3) TYPE: NUCLEIC ACID 
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■*{C) STRANDEDMESS : DOUBLE 
iD) TOPOLOGY: LINEAR 

■ii) MOLECULE TYPE: CDMA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
{=) TISSUE TYPE: Uterus 

(i:<) rEAT'JRE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 26.. 112 

(C) IDENTiriCATION MXTHOD: Von Heijne matri.x 

(D) OTHER INFORMATION: score 6.4 ' 

seq GLCXLCXVXNVFA/GS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



ATAATCTGTA ACTTTAGCCC CAACC ATG TGC TCG CAG AAA CGT GCT GTA TCA 52 

Met Cvs Ser Gin Lys Arg Ala Val Ser 
-25 

AAT CAA GGT TTA ATG GAT TTA GGG CTG TGC ARG CTG TGC YTT GTT AMC 100 

Asn Gin Giy Leu Met Asp Leu Gly Leu Cys Xaa Leu Cys Xaa Val Xaa 
-20 -15 -10 -5 

AAT GTG TTT GCA GGC AGT ATG CCT GGT AAA AGT CAT TGC CAT TCT CCA 14 8 

Asn Vai Phe Ala Gly Ser Met Pro Gly Lys Ser His Cys His Ser Pro 
1 5 10 

TTC TCT ATT AAC CAG GGC AGG 169 
Phe Ser lie Asn Gin Gly Arg 
15 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 29.770 

(C) IDENTIFICATION METHOD: Von Hcijnc matrix 

(D) OTHER INFORMATION: score 6.4 

seq LIVLTLHSPSCOT/AQ 



(xi) SEQUENCE DESCRIPTIOSI: SEQ ID NO: 86: 
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00 

AT JGGGGTTT- CTTTGTTGCT GC?GGG?G ATG CTA ATA GTC CTG ACT CTC CAC 52 

Met Leu He Val Leu Thr Leu Kis 

-:o 

TCG CCC TCC TGT <SAC ACT GCC CAG GAG GAG ATG GGG AGG GTG CCC ACT 100 
Set" Fro Ser Cys Asp Thr Aia Gin Glu Glu Met Giy Arg Val Pro Thr 
-5 1 5 10 

ACT CCC AAG TGC AGG TGG AAG TTA GGG CTC TCC ATG TGT TCT ?TG CTG IsS 
Thr Pro Lys Cys Arg Trp Lys Leu Giy Leu Ser Met Cys Ser Leu Leu 
15 20 25 

ACA CCT GGG I57 
Thr Pro Giy 



(2: INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 66.. 251 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.4 

seq SVLWLGALGLTIQ/AV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 



.z^CTCCCAGA ATGCTGACCA AAGTGGGAGG AGCACTAGGT CTTCCCGTCA CCTCCACCTC 60 

TCTCC ATG ACC CGG CTC TGC TTA CCC AGA CCC GAA GCA CGT GAG GAT CCG 110 
Met Thr Arg Leu Cys Leu Pro Arg Pro Giu Aia Arg Giu Asp Pro 
-60 -55 -50 

ATC CCA GTT CCT CCA AGG GGC CTG GGT GCT GGG GAG GGG TCA GGT AGT 158 
lie Pro Val Pro Pro Arg Giy Leu Giy Ala Giy Giu Giy Ser Giy Ser 
-45 -40 -35 

CCA GTG CGT CCA CCT GTA TCC ACC TGG GGC CCT AGC TGG GCC CAG CTC 206 
Pro Val Arg Pro Pro Val Ser Thr Trp Giy Pro Ser Tro Aia Gin Leu 
-30 -25 -20 

CTG GAC AGT GTC CTA TGG CTG GGG GCA CTA GGA CTG ACA ATC CAG GCA 254 
Leu Asp Ser Vai Leu Tro Leu Giv Ala Leu Giy leu Thr lie Gin Aia 
-13 -lb -5 1 

GTC TTT TCC ACC ACT GGC CCA GCC CTG CTG CTG CT: CTG GTC AGC TTC 3C2 
Vai Phe Ser Thr Thr Giy Pro Ala Leu Leu Leu L'^u leu Val Ser Pr.t 
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10 15 



CTC ACC TTT GAC CTG CTC CAT AGG CCC GCA GTC ACA CTC TGC CAC AGC 350 

Leu Thr Phe Asp Leu Leu His Arg Pro Ala Vai Thr Leu Cys His Ser 

20 25 3C 

GCA AAC TTC TCA CCA GGG GCC AGA GTC AGG GGG CCG GTG AAG GTC CTG j93 

Ala Asn Phe Ser Pro Gly Ala Arg Val Arg Gly Pro Vai Lys Val Leu 

35 40 45 

GAC AGC AGG AGG CTC TAC TCC TGC AAA TGG GTA CAG TCT 4 37 

Asp Ser Arg Arg Leu Tyr Ser Cys Lys Trp Vai Gin Ser 
50 55 60 



(2) INFORMATION FOR 3EQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 237 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 
(0) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig^oeptide 

(B) LOCATION: 133.. 177 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.4 

seq LTCLFLSLISTYP/SC 

(xi) SEQUENCE DESCRIPTION: SEQ :D NO: 88: 



ATTTATGGTA GAGAGATATA TTTGTATTGG TTCCAGTTCC ATTGGTTTGT GAA.ATATTA.!\ 60 

TATGCCAACA CAGCCTAGCA TATTGGAGTC ACTGGAAATG CATCAGTGCT AGCCTTACAT 120 

GCCTTTCACT CT ATG GTG TTA ACC TGC CTT TTT CTA ACT CTA ATC TCC ACT 171 
Met Val Leu Thr Cys Leu Phe Leu Ser Leu lie Ser Thr 
-13 -10 -5 

TAC CCC AGC TGT ATC ACA CTT TTT CTT TCC AAA ATT CCT AAT CCT CTC 219 
Tyr Pro Ser Cys lie Thr Leu Phe Leu Ser Lys He Pro Asn Pro Leu 
15 10 

TCT TCA CTC CCC TCA CTG 237 
Ser Ser Leu Pro Ser Leu 
15 20 



'•2) INFORMATION FOR SEQ ID NO:- 89: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 281 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(0) TOPOLOGY: LINEAR 

(li) MOLECULE TYPE: CDNA 

(^'i} CRIGIMAL SOURCE: 

:A) ORGANISM: Homo Sapiens 
{?) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 171., 224 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.3 

seq FSFSLQLLSSS3T/NP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 



ATCTGTCTCT TGTTTATTAA GATATGCACA GTTTCTGAAT CA.ACAAA7AT ATCTGTGATT 60 

CTTTTATACT ACTACATAAA AGAACAGGGR GTAATTCTTG CCTTATAAAT TAAATGTCAA 120 

ACATTTCCTA TATGTAATCA TTTGTTCCTA AAATATGATT TAGTCCCAGC ATG CTT 176 

Met Leu 

ATC CCT GTT TTC TCT TT? TCT CTC CAG CTC CTA TCT AGT TCT TCA ACA 224 
lie Pro Val Phe Ser Phe Ser Leu Gin Leu Leu Ser Ser Ser Ser Thr 
-15 -10 -5 

AAT CCT GTC AAC TCT ACC TTC CAA ATG CCT TTT GAA TCC AGC CAT STC 272 
Asn Pro Val Asn Ser Thr Phe Gin Met Pro Phe Giu Ser Ser His Xaa 
15 10 15 

ACC ACC AG A 281 
Thr Thr Arg 



{2J INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(iil MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/BCEY: sig^peptide 
(D) LOCATION: 13.Tl55 

(C) IDENTIFICATION METHOD: Von Haijne matrix 
(D» OTHER INFORMATION: score 6.3 

seq LLLIESVSGLLQP/RT 
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(xiJ 3£QUENCE DESCRIPTIOM: 3EQ ID NO: 90: 



PuZ^CCCGGGG G.--=lG ATG GCG GCA GCG TNT CTG ACT GGG CCC TCT GCG GGC 50 
Met* Ala Ala Ala Xaa Leu Ser Gly Pro Ser Ala Giy 
-45 -40 

TCC GCG 3CT GGG GTT CCT GGC GGG ACC GGG GGT CTC TCG GCA GTR AGC 9c 
Ser Ala Ala Giy Vai Pro Gly Gly Thr Giy Giy Leu Ser Ala Val Ser 
-35 -30 -25 -2C 

TCG GGC CCG CGG CTC CGC CTG CTG CTG CTG GAG AGT GTT TCT GGT T7G 146 
Ser Gly Pro Arg Leu Arg Leu Leu Leu Leu Glu Ser Val Ser Gly Leu 
-15 -10 -5 

CTG CAA CCT CGA ACG GGG TCT GCC GTT GCT CCG GTG CAT CCC CCA AAC 194 
Leu Gin Pro Arg Thr Giy Ser Ala Vai Ala Pro Vai His Pro Pro Asn 
1 5 10 

CGC TCG GCA AGG 206 
Arg Ser Ala Arg 
15 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 base pairs 

(3) TYPE; NUCLEIC ACID 

(C) STRANDEDNE5S: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hono Sapiens 
{:) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/ECEY: sig_pep-ide 
(3) LOCATION: 7 8. . 122 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.2 

seq NWLFLr/FTFCNC/FF 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 



CATCTTGTAC ATCTGTKAGC ATGTATCTGT GAACATATCC AT.JVGGCTGGA TACCT.aiGC.i^G 60 

GTCAAAATGA CG7GTGC ATG CAT AAT TGG CTT TTT TTG TTT GTW TTT ACT 110 
Met His Asn Trp Leu Phe Leu Phe Val Phe Thr 
-15 -10 -5 

TTT TGT AAC TGC TTT TTT AAA AAT AAT GGC 140 
Phe Cys Asn Cys Phe Phe Lys Asn Asn Gly 
1 S 
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(2) INFORMATION FOR 3EQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) L'ENGTH: 352 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STR.2iNDEDNESS: CX)LIBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

ivi) ORIGINAL SOURCE: 

(A) ORGANISM: Horoo Sapiens 
(r) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: 24 57.295 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.2 

seq CFYFLSTALGSQA/OS 

(xil SEQUENCE DESCRIPTION: SEQ ID NO: 92: 



ACTACCAATG GAAAATGCAG CTCTTGAGGA TGACGATTGC CAAACAAAGG CTCGGAGACG 60 

AAGCAATCGG CGTGCGACAC TTTGCAGCCC ATGAGCGTGA AGAC7TGGTG CAGCAGCTAG 120 

AGCGAGCTAA GGAACAGGTT CTCACTAACA TCTATTCAGA GTGGGGGATG CATTTGCACA 180 

GCTGGACACA ACACAAACAA GAG7GGACTG TGCCCCTCGT TTCTCAGAGT ATGGGGTGCC 240 

TGGG ATG CAC GTT GAA TGC TTT TAC TTC CTC AGC ACT GCA CTA GGG TCC 289 
Met His Val Glu Cys Phe Tyr Phe Leu Ser Thr Aia Leu Gly Ser 
-15 -10 -5 

CAA GCT GAC TCT TGG GTT TCT GGC CTC CAG CAG GCA GGT CTG CTC OCT 337 
Gin Ala Asp Ser Trp Vai Ser Giy Leu Gin Gin Aia Gly Leu Leu Pro 
1 5 10 

GCT ATT GGG TAC CGG 352 
Ala lie Gly Tyr Arg 
15 



(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANOEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOI.ECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Ovary 
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(ix) FEATURE: 

"*{A) MAMS/KSY: siG_peotide 
(3) LOCATION; 177.. 233 

(C) IDENTIFICATION METHOD: Von Heijne natri.x 

(D) OTHER INFORMATION: score 6.1 

seq LALLWSLPA3DLG/RS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 



ATAAGTGAAC CAGACCACCC TGATGGCATC CACAGTGATG TCAAGGTTGG GGCTGGCCAG 60 

GGGTGGGTGG ACTAGAAGCA TTTGGGAGTA GTGGCCAGGG GCCCTGGACG CT.^^GCCACGG 12 D 

AGCTGCTGCA CAGAGCCTGG TGTCCACAAG CTTCCAGGTT GGGGTTGGAG CCTGGG ATG 17 9 

Met 

AGC CCC GGC AGO GCC TTG GCC CTT CTG TGG TCC CTG CCA GCC TCT GAC 227 
Ser Pro Gly Ser Ala Leu Ala Leu Leu Trp Ser Leu Pro Ala Ser Asp 
-15 -10 -5 

CTG GGC CGG TCA GTC ATT GCT GGA CTC TGG CCA CAC ACT GGC GTT CTC 27 5 
Leu Gly Arg Ser Val He Ala Gly Leu Trp Pro His Thr Glv Vai Leu 
1 5 10 ' 

ATC CAC TTG GAA ACA AGC CAG TCT TTT CTG CAA GGT CAG TTG ACC AAG 323 
lie His Leu Giu Thr Ser Gin Ser Phe Leu Gin Gly Gin Leu Thr Lys 
15 20 25 30 

AGC ATA TTT CCC CTC TGT TGT ACA TCG TTG 353 ' 

Ser lie Phe Pro Leu Cys Cys Thr Ser Leu 
35 40 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 290 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 1807.218 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.1 

seq MALALGSIPSSIA/SS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 



ACAAGGTC.fiiT GAGGCTGAAT TAGGCTTTGT GTGCTCCTTC TAGGTTCTCA GC:;':GTGTCT 60 
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GGTCCTTTAC TTGCTTTAGA TCTCTGCCCC AGCCACT3TA GGCAGG.AACA GCTCTCTTCC 120 

TTGAG.V^CTC AAGAGGTTCT CAAGGTAGTA AACTTCATGG TGCTCTTAGT TTAGTCTGA 179 

ATG GCC TTG GCC TTG GGG TCC ATC CCA AGT TCC ATA GCC AGC AGT TGG 22 7 
Met Ala Leu Ala Leu Gly Ser lie Pro Ser 3er lie Ala Ser S^r Tro 
-10 -5 1 . " 

3TC CAT GTC TCA CAT TTT TGT GCC TGT CTC CTC CAC ACA ACA TTG CCA 27 5 
Val His Val Ser His Phe Cvs Pro Cys Leu Leu His Thr Thr Leu ^ro 

CAG TCC ACC CCG AAG 290 
Gin Ser Thr Pro Lys 
20 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 
(0) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 31.. 78 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.1 

seq FLFCTLFSLVVKP/SH 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 



AATAGTTCAC ATATTTATGT TTTTCCACAA ATG CTA GCA TTT TTG TTC TGC ACT 54 

Met Leu Aia Phe Leu Phe Cys Thr 
-15 -10 

CTG TTT TCT TTA GTA GTG CAT CCT TCA CAC ATA GAT TTA AAA TGC TCA 102 
Leu Phe Ser Leu Val Val His Pro Ser His lie Aso Leu Lys Cvs Ser 
-5 1 * 5 ' 

TTT TAT iOS 
Phe Tyr 
10 



(2) INFO.RMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHAR.-.CTERI ST I CS : 

(A) LENGTH: 349 base oairs 
(3) TYPE: M'JCLEIC ACID 
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tC) STE^ANDEDNESS : DOUBLE 
(D) TOPOLOGY: LINE,:^ 

(ii: MOLECULE TYPE: CDNA 

iviJ ORIGINAL SOURCE: 

(A; ORGANISM: Hcrao Sapiens 
{:: TISSUE TYPE: Spleen 

FEATURE: 
(A) NAME/KEY: sig^oeptids 
(3) LOCATION: 32.. 139 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
iO) OTHER INFORMATION: score 6 

seq LLYTLQTISSLSG/CF 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



AGGTGCAGGG GAGGTAAGGT GGGAGCAGGT C ATG GCT CAA ATG CCA CTG ACA 52 

Met Ala Gin Met Pro -Leu Thr 
-35 -30 

GGC TCT 7AC C-.A GAT TTA GAA TAT TTT C7T GAA TGC ATG TTT CTC CAT 100 
Gly Ser Tyr Gin Asp Leu GIu Tyr Phe Leu Giu Cys Met Phe Leu H^s 
-25 -20 -15 

TTA TTA TAT ACT CTT CAA ACA ATT TCC ACT TTA ACT GGT TGT TTT AAA 14 S 
Leu Leu Tyr Thr Leu Gin Thr He Ser Ser Leu Ser Gly Cys Phe Lys 
-10 -5 1 

CAA TTT TTT TTC CAG TTA PAT TGT TTT TGT TGG GGA GAA ATT CTA TGG 196 
Gin Phe Phe Phe Gin Leu Asn Cys Phe Cys Trp Gly Glu He Leu Trp 
5 10 15 

CAC TCT TCA TTC CTC CAT TCT GGA AGT TGT CTC TTG GTT TTG CTC ATT 24 4 
His Ser Ser Phe Leu His Ser Gly Ser Cys Leu Leu Vai Leu L*u He 
20 25 30 35 

AAA PAA APA AAG ATA TAT CTT CAA TCT CYC TWA ATC TAT ACA GG7 TAC 292 
Lys Lys Lys Lys He Tyr Leu Gin Ser Xaa Xaa He Tyr Thr Gly Tyr 
40 50 

TTW ATA GAT YCT WAA YCT TTA SGT YCC TT: TCC ATC CCT TTA AGT TTC 
Xaa He Asp Xaa Xaa Xaa Leu Xaa Xaa Pns Ser He Pro Leu Ser Phe 
55 60 65 



ATA CAG TTT 
He Gin Phe 
70 



340 



349 



(2) INFOPa^lATION FOR SEQ ID NO: 97: 

(i. SZQ'jZHCZ CHARACTERISTICS: 

(A) LENGTH: 150 base pairs 
;3) TYPE: NUCLEIC ACID 
(C) STRANOEDNESS : DOUBLE 
:0> TOPOLOGY: LINEAR 



wo 99/06549 



74 



PCT/IB98A)1231 



■liiJ. iVOLtCULE TYPE: CDNA 

(VL) CRIGINAL SO'JRC?:: 

(A) ORGAMZSM: Homo Sapiens 
(?) TISSUE TYPS: Ovary 

(ix: FnlATURE: 

(A) NAME /KEY: sig^pepcide 
(3) LOCATION: 91.. 135 

iC) IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score 6 

seq LLMGLWVRTVLOG/KE 

(xi: SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

A.ATAAAGCAT ACAGAAACCC ACCTAAAATA GACTCAGGGA GGTAGGAGGT TTCCTAAGGG 60 

CTGAGACTGA AAGATAATAG GGATTGCTTG ATG GCA TTG TTG ATG GGG CTG TGG 11^ 

Met Ala Leu Leu Met Giy Leu Trp 
-15 >10 

GTG AGA ACA GTG CTC CAG GGA AAA GAG GCC AGC GGG 150 
Val Arg Thr Val Leu Gin Gly Lvs Glu Ala Ser Gly 
-5 1 5 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 
. (D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 100.. 156 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6 

seq LAILIXSLKLTIG/IQ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 



ATTAAAGT7G GAGAGAGATT AGAGGCAGAA TTAACAGAAA GGAGATGTGA GAATCCAGTA 50 

GTCATTTAi.T TTTAAAAAAC AGGTATTCAA TAAAATTTT ATG ATT AAC CAT TTA 114 

Met He Asr. His Leu 



- 1 ^ 



TAT TTG GCT ATT CTT ATT KTT TCT TTA .£u^A TTA ACA ATA GCiA ATC CkZ 162 
Tyr Leu Ala He Leu lie Xaa Ser Leu Lys Leu Thr lie Giv 11^ Gin 

-10 -3 • : 
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AAA CGT TTC GGA CCA CCG 
Lys Arg Phs Gly Pro Pro 

5 



130 



(2) INFORMATION FOR SEQ ID NO: 99: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

<VL) ORIGINAL SOURCE: 

(A) ORGANISM: Hcmo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAt-IE/KEY: sig^peptide 

(B) LOCATION: 12.. 161 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6 

seq LLYLCSFPLPGTS/LF 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 



AAAACAAAAT T ATG GGT AGG CAA GGG ACT TTA GAA ATT GAG GGC ATT CTC 50 
Met Gly Arg Gin Gly Thr Leu Glu lie Giu Gly He Leu 
-50 -45 -40 

TGT GTC ATC ACT TGG TTA GAG GCA AAT CTA GGG AAA CAA AAA GAT GAG 98 
Cys Val lie Thr Trp Leu Giu Aia Asn Leu Gly Lys Gin Lys Aso Glu 
-35 -30 -25 

AAT CAC TAG TAT AAG AAA TTA TCC CTT TTA TAC CTT TGC T::A TTT CCA 14 6 
Asn His Tyr Tyr Lys Lys Leu Ser Leu Leu Tyr Leu Cys Ser Phe Pro 
-20 -15 -10 

CTG CCT GGA ACG TCC CTT TTT CTT CTC TGC TCT TTC TCA TAT CTT ACT 194 
Leu Pro Giy Thr Ser Leu Phe Leu Leu Cvs Ser Phe Ser Tyr Leu Thr 
-5 1 5 10 

CAA AGA CTT TCC CAA GGT GGA GGG 218 
Gin Arg Leu Ser Gin Giy Giy Giy 
15 



(2) INFORMATION FOR SEQ ID NO: 100: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 394 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 



wo 99/06549 



76 



PCTAB98/0t231 



MOLECULE T^PHI: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSCE TYPE: Testis 

(ix; FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 173.. 289 

(C) IDEMTIFICATION METHOD: Vcn Heijne matrix 

(D) OTHER INFORMATION: score 5.9 

seq SA>/WCVL1EWSQG/A3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 



ACTGGGGAAA TTGAGCCTAA GAGAACAGAA AGTACTTGAG GTCCCACAAT GAATCT.^TGG 50 

ATGAATGAGT GCTTATTCAT TCACTCATTT TTTAAAAAAA TCCAT7CCAC A^GTATGTCT 120 

TAATCAC7GC AGTGTAAGGC ACATAGGGAC AAAATAGAAG ATTCCTGTCC TC ATG GAA 178 

Met Glu 

CTC ACA A.AC AAG CAA ACA GGA AC? GAC AGA CAT GAA CAG GTA CTA CGG 226 
Leu Thr Asn Lys Gin Thr Giy Thr Asp Arg His Glu Gin Val Leu Arg 
-35 -30 -25 

AGG GTA AAG CAA GAC AAG AGG ATA AGT GCA TGG TGG TOO GTT T7A CTG 274 
Arg Vai Lys Gin Asp Lys Arg lie Ser Ala Trp Trp Cys Val Leu Leu 
-20 -15 -10 

GAG TGG TCA CAG GGG GCC TCT CTG AGG AGG CAA CAT CGA GGG GAG ACA 322 
Glu Trp Ser Gin Gly Ala Ser Leu Arg hrq Gin His Arg Glv Glu Thr 
-5 1 5 ■ 10 

AGC CCC .A-^A TCT GGG GA.A AGA CTT TCC AGG CAG AGA GAA CAG CAA AAA 370 
Ser Pro Lys Ser Gly Giu Arg Leu Ser .Arg Gin Arg Glu Gin Gin Lys 
15 20 25 

CCG CAG ATG AGT GAC AAG AGC CTG 394 
Pro Gin Met Ser Aso Lvs Ser Leu 
'30 * * 35 



(2) INFOS^TION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL 30URCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(i:<) FEATURE: 

; A ) MAXE / KEY : sig^peotide 
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(B) LOCATION: 4.. 69 

(C) IDEN'TIFICATION iMETfiGD: Von neijne mamx 
CD) OTHER INrCRMATION: score 5.9 

seq VLGLLFSISDTWA/PA 

(si) SEQUEN-Ci DESCRIPTION: SEC ID NO: 101: 

ACA ATG GCC AAG CGI CAA AAT CCT ACG TCA GTG CTA GGA CTG GTT TTT 4 3 

M^t Ala Lys Arg Glr. Asn Pro T.^r 3er Vai Leu Gly Leu Leu Ph* 
-20 -15 -10 

TCT ATA TCA GAC ACG TGG GCT CCT GCT GTG TCT TCC TGG AAA GCA GAG 96 
Ser lie Ser Asp Thr Trp Ala Pro Ala Vai 3er Ser ?rp Lys Ala Giu 
-5 1 5 

GCC AAG GAT GGA GCA GAC CAA GAG GAT GCC AGG WWA WAA TCA CAA AGA 14 4 
Ala Lys Asp Gly Ala Asp Gin Glu Asp Ala Arg Xaa Xaa Ser Gin Arg 

15 20 25 

AGC CCA GAW AGC ACA GC? GGA AGC CAA GAA CCC TAT TTT TGG TTT GTG 192 
Ser Pro Xaa Ser Thr Ala Gly Ser Gin Glu Pro Tyr Phe Trp Phe Vai 
30 35 * 40 

TGG GTG GAA GGT GAG GGA CGG 213 
Trp Vai Glu Gly Glu Gly Arg 
45 



(2) INrORMATION FOR SEQ ID NO: 102: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEONESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: big peptide 
(S) LOCATION: 2507.324 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: s;;oi« 5.9 

seq FCLSLQIFRVSLA/LA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 



A7AAGGCTAG TTCTATTTTG AAGCCTATGT GTTTTGTGAA ACAC.AAAAAA AAGTACAGAG 60 

AAA.GATCGCA TCCTTTTC7G GTAGGGGTTT TCAGGAAAAA GTAA.GAGTTC TGPiCTCATGT 120 

rCGGATTTCT TGGGCCGTTA TTCTGCAGTG GTCAAAATGG GGG-A-AGCATG TCTG7AAAAG 180 

TGTTACTGA7 ATGAC7AACA CTAACTGATC TACTTTCAJ^A CA77ACCTTT 77CC7CTCCC 2^0 
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TCCCTGTTT ATG AAT GTT TTG CCC TTC TCT TAG TAT TAT AT:: TT: TTT TGT 291 
Mer Asn Vai Lea Pro Phe S^r Tyr Tyr Tyr lie :,-?>.i Phe Cys 
-25 -20 ' -15 

TTG ACT TTA CAA ATT TTC AGA GTT TCC CTA GCT CTG GCA CAS ACT CAT 3.19 
Leu Ser Leu Gin He Phe Arg Vai Ser Leu Ala Leu Ala Xaa Thr His 
-10 -5 . 1 5 

GAG GTT CCT GTC TCT ACT CAT ACT AAC RAA TTG CAT 37 5 

Glu Vai Pro Vai Ser Thr His Thr Asn Xaa Leu His 
10 15 



(2) IMFORMATION FOR 3EQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDKSSS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 17.. 103 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(U) OTHER INFORMATION: score 5.9 

seq rSYISXFLSPVCG/CS 

(xi; SEQUENCE DESCRIPTION: SEQ ID NO: 103: 



ATCAAAATTC TCTTTG ATG AAA TGT TTA AAA GTG AAC CCT TTT TTA TTT CTG 52 
Met Lys Cys Leu Lys Vai Asn Pro Phe Lei: Phe Leu 
-25 -20 

GTW TTT AAT TTC TTT TCC TAC ATC AGT KGC TTT TTG TCA CCA GTA TGT 100 
Vai Phe Asn Phe Phe Ser Tyr He Ser Xaa Phe Leu Ser Pro Vai Cys 
-15 -10 -5 

GGA TGT TCT GTC TGT AAT TTA AAA CAC TGG GAG AAT GAG CTT CTA TTT 143 
Gly Cys Ser Vai Cys Asn Leu Lys Hiii Trp Glu Asn Glu Leu Leu Phe 
15 10 15 

CCT TCT CCC CAC TTT TTG CCA TAT AAA TTT TTN TTT CTT TTT 190 
Pro Ser Pro His Phe Leu Pro Tyr Lys Phe Xaa Phe Leu Phe 
20 25 



(2) IMFORM.\TIOM cOR SEQ ID NO: 104: 

;i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 base pairs 
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(S; TYPE: MUCLEIC ACID 
(C; STR.\NDEDN£SS: D0-J3LH: 
(D; TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Saoieas 
{?) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_pept: ide 
(8) LOCATION: 7 4., 172 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.8 

seq XLCLGMALCPRQA/TR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 



ATCTCTTGGC GTCTCAACGT TCGGATCAGC AGCTTTTTTC CATTCTCTCT CTCCACTTC7 60 

TCAGTGAGCA GCC ATG ACT TGG ACT GIG CCT GTT GTG CGG GCC AGC CAG 109 
Met Ser Trp Thr Val Fro Val Vai Arg Aia Ser Gin 
-30 -25 

AGA GTG AGC TCG GTG GGA GCG AAT KTC CTA TGC CTG GGG ATG GCC CTG 157 
Arg Vai Ser Ser Vai Giy Aia A^n Xaa Leu Cys Leu Gly Met Aia Leu 
-20 -15 -10 

TGT CCG CGT CAA GCA ACG CGC ATC CCG CTC AAC GGC ACC TGG CTC TTC 205 
Cys Pro Arg Gin Aia Thr Arg lie Pro Leu Asn Giy Thr Trp Leu Phe 
-5 1 5 10 

ACC CCC GTG AGC AAG ATG GCG 225 
Thr Pro Val Ser Lys Met Ala 
15 



(2) INFORMATION FOR SEQ ID NO: 105: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii} MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

{A} ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix} FEA.TURE: 

(A) NAME/KEY: sig_oeotide 

(B) LOCATION: 111/. 155 

(C) IDENTIFICATIOM METHOD: Von Keijns .Tatri:< 

(D) OTHER INFORMATIOJJ: score 5.3 

seq FLXLMTLTTHVHS/SA 
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(Xi> SEQUENCE DESCRIPTION: SEQ ID NC: 105: 



ATCCGATACA GAACA7GCAG TAATGTGGAC TGCCCACCAG AAGCAGGTGA TTTCCGAGCT 60 

CAGCAATGCT CA3CTCATAA TGATGTCAAG CACCATGGCC AGT7TTATGA ATG GGY 116 

Met Giy 
-15 

TTC CTG WGT C7A ATG ACC CTG ACA ACC CAT GTT CAC TCA AGT 3CC AAG lo4 
Pn.e Leu Xaa Leu Met Thr Leu Thr Thr His Vai His Ser Ser Ala Lys 
-10 .5 1 

CCA AAT GGG 173 
Pro Asn Giy 
5 



(2) rNFOW<.ATIOM FOR SEQ ID NO: 105: 



;i) SEQrJZNCS CHARACTERISTICS: 

(A) LENGTH: 98 base pairs 
(3) TYPE: MUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 



(ix) FEATURE: 

(A) NAt'!£/KEY: sig^peptide 

(B) LOCATION: 33.. 80 

(C) IDENTIFICATION METHOD: Von Heijae matrix 
(0) OTHER INFORMATION: score 5.7 

seq RVLLLAQLFLGSG/KT 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 



AAATTCTCTG GGCCTGCTTG TCATCACTCC AG ATG TTG TTT AGA GTT CTT CTG 53 

Met Leu ?he Arg Vai Leu Leu 
-15 -10 

TTA GCA C.AG CTG TTT CTA GGG TCT GGA AAA ACT CTA AGG ACC CCG 98 
Leu Aia Cln Lou Phc Leu Cly Ser Cly Lys Thr Leu Arg Thr Pro 
-5 1 5 



:2) INFORMATION FOR SEQ ID NO: 107: 

iL) SEQUENCE CHARACTERISTICS: 

:A) LENGTH: 243 base pairs 
■B- TYPE: NUCLEIC ACID 
C; 3TR.AMDEDNE3S: DOUBLE 
C: TOPOLOGY: LINEAR 
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(iit- MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix; TEATURE: 

(A) NAME/ KEY: sig_pectiGe 
(3) LOCATION: 79.. 174 

(C) IDENTIFICATION METHCD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5,7 

seq SLPLSTSAPPLRG/LR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 



AACAGTCCTG CCGGCTGGCT TGGGTGGGTG GTGGGCTGCG GGTAGGGGAG GGGATGGACC 60 

GAGTCCCGGC TTGTCGGG ATG AGG GTT CCG GAA GAT CTG GCC AGT AAG ATT 111 
Met Arg Val Pro Glu Asp Leu Ala Ser Lys He 
-30 -25 

CTA CTC CCT GGC TGT GCA CCG GGT TCC CTA CCC CTG TCT ACG TCG GCT 159 
Leu Leu Pro Gly Cys Ala Pro Giy Ser Leu Pro Leu Ser Thr Sar Ala 
-20 -15 -10 

CCG CCA CTT CGC GGC TTG AGA CTA AAA GAG CAT CCC GGC AGG GGG CC? 207 
Pro Pro Leu Arg Gly Leu Arg Leu Lys Glu His Pro Giy Arg Gly Pro 
-5 1 5 10 

TCC AGC CCC AAA GCA GCC TGT CCA GAG ACC CCC GCG 24 3 

Ser Ser Pro Lys Ala Ala Cys Pro Glu Thr Pro Ala 
15 20 



(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : D0U3LE 

(D) TOPOLOGY; LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Homo Sapiens 
(F) Tissue TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/ECEY: sig_peptide 

(B) LOCATION: 63. .155 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.7 

seq SDLCLCQCILARA/HD 

(:'.ij SEQUENCE DESCRIPTION: SEQ ID NO: 1C2: 



ACATTTCTAA CGAGTACTGC CCAATCAT7T GGTAGAATGC CC:TT.V*CTT TCCCTTWCTT 53 
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TG A7G TTT CCT CAC AGW GAR ACT CAG GTT AAG T3T TTT TGG CA3 GGA LO" 
Met ?he Pro His Xaa Giu Thr Gin Vai Lys Cys Phe Trp Gin Giv 
-30 -25 -20 

TTA CGC AGA AGC GAT CTG TGT CTG TGT CAA TGC ATC CTA GCA AGG G?A 155 
Leu Arg Arg Ser Asp Leu Cys Leu Cys Gin Cys lie Leu Ala Arg Aia 
-15 ' -10 -5 

CAT GAT G3C GAT TTA TAC CTT TTT TTT 182 
His Asp Gly Asp Leu Tyr Leu Phe Phe 
1 5 



{2) INFORMATION FOR SEQ ID NO: 109: 

(i; SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: D0U3LE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 81.. 140 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.6 

seq LAVFMXLAQLVSG/NW 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 



AAAAGAAGGA CAATAAAGAT CTGTGTTCAG AGTCATACTG AATAGAGACT TCTGGACTCT 6G 

ATAGAACCCA CTGCCTCCTG ATG AAG TCC CTA CTG TIC ACC CTT GCA GTT TTT 113 

Met Lys Ser Leu Leu Phe Thr Leu Ala Val Phe 
-20 -15 -10 

ATG CKC CTG GCC CAA TTG GTC TCA GGT AAT TGG TAT GTG AAA AAG TGT 161 
Met Xaa Leu Ala Gin Leu Val Ser Gly Asn Trp Tyr Val Lys Lys Cys 
-5 1 5 

CTA AAC GNN TTT GGA ATT TGC AAG ANG AAG TGC AA-S CCT GAA GAG ATG 209 
Leu Asn Xaa Phe Gly He Cys Lys Xaa Lys Cys Lys Pro Giu Glu Met 

10 15 20 

CAT GTA AAG AAT GGT TGG SCA ATG TGC GGC AP.a. C.A.^. AGG GAC TGC TGT 257 
His Vai Lvs Asn Gly Trp Xaa Met Cys Gly Lys 'jIp. Arg Asp Cys Cys 
25 ' 30 35 

GTT CCA GCT AAC GGG 272 
Val Pro Ala Asn Gly 
40 
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(2) INFORf^-ATION FOR SEQ TO MO: 110: 

{1} S5:0USMC£ CHARACTERISTICS: 

(A) LENGTH: .161 base pairr, 
(3) TYPE: NUCLEIC ACID 
(C: STR.ZVNDEDNESS: DOJBLE 
(01 TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME /KEY: sig_peptide 

(B) LOCATION: 13. .86 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.6 

seq LLNV.ACCIPFSSS/LF 

(xil SEQUENCE DESCRIPTION: SEQ ID NO: 110: 



ATTTTCCA.AA CATT6TG ATG CAC CTT TAT AGC TGT TCG TGT ATG CGC CTT 50 
Met His Leu Tyr Ser Cys Ser Cys Met Arg Leu 
-20 -15 

TTA AAC GTG GCA TGC TGC ATA CCC TTT TCG AGC AGC CTG TTT CCG CAC 98 
Leu Asn Val Ala Cys Cys lie Pro ?he Ser Ser Ser Lea Phe Pro His 
-10 -5 1 

ATT CTT TTC AAG TCA TTA AAC TAT TCC 'TTG kCG TCC TTT CTC AAG GCT 14 6 
lie Leu Phe Lys Ser Leu Asn Tyr Ser Leu Thr Ser Phe Leu Lys Ala 
5 10 15 20 

GTG CGT GGC CGG TGG 161 
Vai Arg Giy Arg Trp 
25 



(2) INFOSx^-lATION FOR SEQ ID NO; 111: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) rE.ATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 223.. 2*70 
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:C) IDEKTIFICATICN METHOD: Von Kei;ne .-na-riy. 
■D) OTHER INFORMATION: score 5.5 

seq PLVLSPLSYQCSS/CG 

(y.i: SEQUENCE OESCRIPTIOM: SEQ ID NO: 111: 



AATGTTT.--=.3 ATCTGTTTTA AATTTAAAAC AATGAAT7GA ATGCTCTA.=^G AGGCTCCTAC 60 

AGGCGCTCCA GGCCACTCTC AGAGACTCCC AGGAGTTGTT GA.ACTATATT TGGAGAAAAC 120 

AGCCAMTGA-A TATTATCATT TCTCCTTTAA AGAGAGTTTG TAAGGG3GGA ACATGCATTT 180 

TATCAGACAA TTTATCCAAA GCATTTCAGA ACATGAGTGC TG ATG AGG GCA OCT 234 

Met Arq Ala ?ro 
-15 

CTT GTG CTG AGT CCC CTC AGC" TAT CAG TG? TCT TCT CAA GGA CAC ATT 282 
Leu Val Leu 3er Pro Leu Ser Tyr Gin Cys Ser Ser Gin Glv His lie 
-10 -5 1 

TGG 235 
Trp 



(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHAR.ACTERISTICS : 

(A) LENGTH: 262 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(iij MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY:' sig peptide 

(B) LOCATION: 1467.253 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.5 

seq FTSMCILFHCLLS/FQ 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 112: 



AACTTGGG.-.C 


.-.AGAR/^.TCAA ACTTTAAAGA TGGTCT.JL^.-.G 


CCCCTCTTAA AGGTCTGACT 


60 


GTGTCGGACC 


7CTAGAGCTA ATCTCACTAG ATGTGAGCCA 


TTGTTTATAT TCTAGCCATC 


120 


CTTTCATTTC 


ATTCTAGAAG ACCCC ATG CAA GTT CCC 


CAC CTA AGG CTC TGG 


172 




Met Gin Val Pro 


His Leu Arg Val Trp 






-35 


-30 




A C«-io - - 


:- A'/.'A GAT ACC T7C ATT GGT TAT A'.- A 


AAT TT3 GGA T7T ACA 


220 



"r.r Gin Vil Xaa Asp Thr Phs lie Gly Tyr Asn Leu Gly ?he Thr 
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-23 -20 -15 

AGT ATG TGC ATA TTG ?TC CAC TGT CTT CTT AGC TTT GAG AGG 262 
Ser Me:; Cys lie Leu Phe His Cys Leu Leu Ser Phe Gin Arg 
-IC - -5 1 



(2) INFORi^ATION TOR SEQ ID NC: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS; DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 46.7153 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.4 

seq LWLMHQSFQKSNS/SS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 



ACTACGAATG CAGATGTGGA AACAACTTCT GTGCATCTCA TCGTT ATG CAG AAA CTC 57 

Met Gin Lvs Leu 
-35 

ATG GCT GTA CCT ATG ATT ACA AGA GCG CAG GGA GGA GAT ACT TGC ACG 105 
Met Ala Val Pro Met lie Thr Arg Ala Gin Gly Gly Asp Thr Cys Tnr 
-30 -25 ' -20 

AGG CAA ATC CTG TGG TTA ATG CAC CAA AGC TTC CAA AAA TCT AAC TCT 153 
Arg Gin lie Leu Trp Leu Met His Gin Ser Phe Gin Lys Ser Asn Ser 
-15 -10 -5 

TCC TCT ACA TCT TAG TGT TCT GCC CAG GGG 183 
Ser Ser Thr Ser Tyr Cys Ser Ala Gin Giy 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 base pairs 

(B) TYPE: NUCLEIC ACID 
{C) STRANDEDNES3: DOUBLE 
(D) TOPOLOGY: LINEAR 

(iii MOLECULE TYPE: CDNA 



(vi} ORIGINAL SOURCE 
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(A) ORGANISM: Homo Sapiens 
I'D tissue: T^pr: cvary 

:ix) F£ATUR£: 

(A) EIAME/KEY: sig^peptide 
(3) LOCATION: i . . 1*35 

iC) IDENTinCATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score S.A 

seq AKRSLCLWPACLC/AR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 



ATG TGI RTA GCT GGG TTT WAT GAC CAC 
Met Cys Xaa Ala Gly Phe Xaa Asp His 
--45 -40 

CGC ACG TCC CGC CAC CCC CTC OCT TGG 
Arg Thr Ser Arg His Pro Leu Pro Trp 
-25 

GCA CAC CGT TCC CTA TGT CTG TGG CCC 
Aia His Arg Sar Leu Cys Leu Trp Pro 
-10 «5 

CTC CCC CCA GCG CCA GGN 
Leu Pro Pro Ala Pro Gly 



CCT CGT GCG GCC CGG CAC GCC 4 8 

Pro Arg Ala Ala Arg His Aia 
-35 -30 

GTG TGT GTC TCT CAG CYC CCT 96 
Val Cys Vai Ser Gin Xaa Pro 
-20 -15 

GCG TGC CT3 TGT GCG CGT GTG 14 4 
Ala Cys Leu Cys Ala Arg Val 



162 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 127 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 62.7*115 

(C) IDENTIFICATION METHOD: Vcr. ll-ijne matrix 
(0) OTHER INFORM-ATION: score 5.4 

seq ILVSFILAALSLS/TT 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: ii5: 



ATCGGACTGA ACGGATCGCT GCGAGGATTA TCTTACACTG AACTGATCAA GTACTTTGAA 60 

A ATG ACT TCG AAA TTT ATC TTG GTG TCC 77C ATA CTT GCT GCA CTG ACT 109 

M^: Thr Ser Lys Phe lie Leu Val Ser P.-^e lie leu Ala Aia Leu Ser 
•15 -10 -5 
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CTT TCA ACC ACC ATA GGG 
Leu Ser Tnr Thr.lie Gly 



(2; IN-rORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHAR.ACrERISTIC3: 

(A) LENGTH: 332 oase pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDECNESS: DOUBLE 
<D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptid9 
(3) LOCATIOM: 2"'9..323 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.4 

seq LLIFILTVHHTPS/LP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 



ATCTTTTGTT TGAGTATCTT CAAGAAAAAT CTGTTGTGAG AAAGATCCTA AACATATGTA 60 

TGTATAGATG CATATCTTTG AAAGCCTA7G TGAATACCAA GGGAATCTGA ACTTTTTCTT 120 

TGGAGATGTT TACATAATAA ATCTATTTTC ATCAATCTGG CATATTTTTC TCCTAGCACT 180 

GACTTACTGA ATGCCGCTGA CCACGTGCTG CCTCTCATGC TAAATGCTTA CTT.AATTCAT 240 

CACCA.aj;TTC TGTAGACTGT ACAGGCTAAA CACCTCTA ATG CAT TTA CTT ATT TTC 296 

Met His Leu Leu lie Phe 
-15 -10 

ATC CTC ACT GTC CAT CAC ACT CCC TCC CTC CCC TCG 332 
lie Lsu Thr Vai His His Thr Pro Ser Leu Pro Ser 
-5 1 



(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
{F) TISSUE TYPE: Ovary 
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•ix; FEATURE: 

(A) NAME/KEi: sig_pept:ide 

(B) LOCATION: 129.. 175 

(C) IDENTIFICATION METHOD: Von Hsi^ne matrix 

(D) OTHER INFORKATION: score 5,3 

seq S3LMVQLI3QVV3/OJ 

i:<i) SEQUENCE DESCRIPTION: SEQ ID NO: i:?: 



PiCAGGAAGTT TGCCTAGAAG GAATAAATTA 

GA.\TGT.^ATT TGTTTTAGGT GGTGGTAATT 

TGTCAGAT ATG CTG TCA TCC TCA TTA 
Met Leu Ser Ser Ser Leu 
-15 

TAT ACT TGT ATG AGG AGG 
Tyr 3er Cys Met Arg Arg 



ACTCTTGTTA CTTGGTGAGA TCATGGAAGG 60 

GTGAGTTTGA GGCTGGCCCA GGAAATGAGT 120 

ATG GTT CAG CTT ATT TCT CAG GTT 170 
Met Val Gin Leu lie Ser Gin Val 
-10 -5 

199 



(2 J INFORMATION FOR SEQ ID NO: 118: 

■i) SEQUENCt; CHARACTERISTICS: 

(A) LENGTH: 14 6 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Saoiens 
(F) TISSUE TYPE: Testis 

(i:<) FEATURE: 

(A) NAME/KEY: sig^peotida 

(B) LOCATION: 57.. 98* 

(C) IDENTIFICATION METHOD: Vor. Heijne matrix 

(D) OTHER INFORMATION: score 5.3 

seq rsVILCMLFCLFS/QD 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 



ACTAa.TCCCC TATTTAGGTT GTTTACTTTT AGGTATTCTG CATAGAGCTG TGATGG ATG 59 

Met 

TTC TCA TAT ATA CTT TGC ATG CTT TTC TGC TTA TTT TCT CAG GAT AAA 107 
?he Ser Tyr lie Leu Cys Met Leu Pre Cys leu ?he Ser Gin Asp Lys 
-10 -S 1 

TTT CTG G.CiA GTG ACA TTG TTG TGT G.^J-. S-.ZZ TAT ATG CTT 
?hs Leu Glu Val Thr Leu Lau Cys Glu ;.r:;i Tvr Met Leu 
5 10 ■ ■ 15 
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(2- INrORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LpJGTH: 14 5 base pairs 
{B} TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LIMEAR 

iii) MOLECULE TYPE: CONA 

tvi) ORIGINAL SOURCE: 

(A) ORGANIStM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 11.. 67 

(C) IDENTIFICATION METHOD: Von Heijne niarrix 

(D) OTHER INFORMATION: score 5.2 

seq VTLAFSLLVLSES/AV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 



AYCWTCTTAA ATG TTA TTT TTA TAT TAT GTT ACA CTT GCA TTC TCT TTA 4 9 

Met Leu Phe Leu Tyr Tyr Val Thr Leu Ala Phe Ser Leu 
-15 -10 

TTG GTG TTA TCA GAG TCA GCA GTA CTG AAA AGA AGA GAA ATC TTT TGR 97 
Leu Val Leu Ser Giu Ser Ala Val Leu Lys Arg Arg Glu lie P.^.e Kaa 
-5 15 10 

ACA GGG TTA GGT TGT GTG ACA GGG TTA GGT TGT GTG ACA GGG TTA CGG 14 5 
Thr Gly Leu Gly Cys Val Thr Gly Leu Giy Cys Val Thr Gly Leu Arg 
15 20 25 



(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

iix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 1437.184 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORt^ATION: score 5.2 

seq LLSGLWLSSVKEC/DD 

SZQUZtiCZ DESCRIPTION: SZZ ID I'.O: 120: 
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AAGGAGTAGT GGCTTTGTTC CCAGCTCAGT GAAGGGTGGC ATGGTCTCTC CTG7CCACT? 60 

CACTCTGGA? TCTTT^JVCGC TGTGAAT7AC TAGACATGGA TTCCATCTCC AATGTGGATG 12C 

CCTCTCTTCA CCACAAGA.-.T AC ATG CTC CTT TCT GGG C7G TGG CTT AGC TCG 172 

Met Leu Leu Ser Gly Leu Trp Leu Ser Ser 
-10 -5 

GTC AAG GAG T3T GAT GAG TGG CGA GCA GAT GGC TGC CTT CCA TCC ATC 220 
Val Lys Glu Cys Asp Asp Trp Arg Ala Asp Gly Cys :.eu Pro Ser lie 
1 5 10 

GTC CJKC CCC CTA AGG 2 35 

Val Mis Pro Leu Arc 
15 



(2) INFORMATION FOR SZQ ID NO: 121: 

(i) SZQUENCS CH.\RACTERISTICS: 

(A) LENGTH: 131 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRAT^DEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^pepcide 
(3) LOCATION: 59.. 112 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.2 

seq VFCFSWLMSSSSP/SI 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 



ATACAAAGGA AATTAGTATG TTCCTTGAGG TTCAGGGAAT CTATGTATAT TTCAGATC 58 

ATG GTT GCA TTT TCA GTC TTC TGT TTT TCA TGG TTG ATG AGT TCA TCA 106 
Met Val Ala Phe Ser Val Phe Cys ?he Ser Trp Leu Met Ser Ser Ser 
-15 -10 -5 

AGT CCT TCC ATC TTT TGG AGT CAT TTC TAT TCA CCA TTC AAG GAT CTA 154 
Ser Pro Ser lie Phe Trp Ser His ?h«? Tyr Ser Pro Phe Lys Asp Leu 
15 10 * 

TCT AAA ATG TAT AAT TAT GTC TCC CCG 131 
Ser Lys Met Tyr Asn Tyr Val Ser Pro 
15 20 



(25 I>:F0?>LAT::M for seq id NC: 122: 
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{i) SEQUENCH: CHARACTERISTICS: 

(A) LENGTH: 248 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) 3TRAKDEDNESS : DOUBLE 
CD; TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Komo Sapiens 
(?) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 123.^170 

•:C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.1 

seq LALGIGPPGCLQG/SP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 



ATGTTTTCTT AAGATCCAGA AGTTTTTGCT 7TAGCTTAAG GATGTGTGCA AT7TTCCATG 60 

TGGCTTCATA ATTCATCCAT GACTTTGAAT TTTAAAATGG AGAGAAGTTG GCTTCCCAGG 120 

AA ATG GTG CCC CTG GCC CTG GGC ATC GGC CCA CCT GGC TGT CTC CAA 167 
Met Vai Pro Leu Ala Leu Giy lie Gly Pro Pro Giy Cys Leu Gin 
-15 -10 -5 

GGC TCT CCT TCC CAG TGG CTG GTG CGG GCT CCG GGA GCT CAG CTG AGT 215 
Gly Ser Pro Ser Gin Trp Leu Val Arg Ala Pro Gly Ala Gin Leu Ser 
- 5 10 15 

CCC ATT GGG GTG GCA ACG GAA AGG GAG CAG AGG 24 8 

Pro lie Gly Val Ala Thr Glu Arg Glu Gin Arg 
20 25 



(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 64.Tts'<? 

CO IDENTIFICATION MBTHOD: Von Heijne matrix 
CD) OTHER INFORMATION: score 5.1.' 

seq LLWFCTAMRPGGA/GL 
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(xi) SEQUENCE DESCRIPTION: 3EQ ID NO: 123: 



AGGATTAAGC AAGCACAGCC CTAGTTGATC ACCCAGCATG AAA.AGTCCTG GP.-i'.TCTCTCA aO 

GAG ATG A-AG CTG TGT ATG GGA GTT T7G CTT AAA GTK GGT ACT TCA AGA 105 

Met Asn -eu Cys Wet Gly Val Leu Leu Lys Vai Gly Thr Ser Arg 
-30 -25 -20 

AGG TGC CTC TGT TTA CTT TGG TTT TGC ACT GCC ATG CGA CCA GGT GGT 156 

Arg Cys Leu Cys Leu Leu Trp Phe Cvs Thr Ala Met Arg Pro Gly Gly 
-15 -10 ' -5 

GCA GGT CTC CCA AAT GCC ACC CCC GAA TGG 13 5 

Ala Giy Leu Pro Asn Aia Thr Pro Glu Trp 
1 5 



(2) IMFOE^J^TION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEONESS : DOUBLE 
(DJ TOPOLOGY: LINEAR 

(ii) MOLECOLS TYPE: CDNA 

(vi) ORIGINAL SOURCE; 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 112,. 153 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.1 

seq SLAKSLFLRVARG/LG 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 



ATTTTTAAGG GAAAGACCCG GAAACAGCAC ATTCTCTTTT TCCAGTAGCC GGAATTTGCA 60 

ACTACATATA GTCGCAAAGA AGACTGGGAG GWWATCTTTA GTTGGGAAGC A ATG ACT 117 

Met Ser 

CTA GCA AAA TCT CTG TTT TTA AGG GTG GCA AGG GGA CTG GGG 159 
Leu Ala Lys Ser Leu Phe Leu Arg Val Ala Arg Glv Leu Gly 
-10 -5 " 1 



(2) INFORMATION FOR SEQ ID NO: 125: 

(1) SEQUENCE CHARACTJvRr^J-^Tr.S: 

(A) LENGTH:. 342 bas-» pairs 
•3) TYPE: NUCLEIC ACXU 
\Z\ STRAMDEDNESS: DCUBLE 
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CD) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 

{ix: FEATURE: 

(A) NAME/KEY: sig__pepcide 

(B) LOCATION: 61. . 114 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(0) OTHER INFORMATION: score 5.1 

seq FLPSATLLL3AES/rr 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 



AAGGGCTCTG CCTCTTCCCT ATACCATGCT GTCTTCCATA GCCTTCCTCC TGTCCTACTC 60 

ATG AGA CTG CCT CCA TTT CTT CCT TCT GCA ACC CTG CTC CTA TCA GCT 108 
Mec Arg Leu Pro Pro Phe Leu Pro Ser Ala Thr Leu Leu Leu Ser Ala 
-15 -10 -5 

GAA TCC TTC TTT CGG AGT GT7 AGT GAG TAG CCG TCT CTC CCC AGO CCC 156 
Giu Ser Pile Phe Arg Ser Val Ser Giu Tyr Pro Ser Leu Pro Ser Pro 
1 5 10 

TCA GCT GGT GGG CCT GGG TGT GTC AGC GGC APJK TGG GGC TCT GGT TCC 204 
Ser Ala Gly Giy Pro Gly Cys Vai Ser Giy Lys Trp Gly Ser Gly Ser 
15 20 25 30 

AAT GGG CCA CTC TCA TCT CTC TCT TGT TCC TTG TGC AGA AAA CCT TTG 252 
Asn Giy Pro Leu Ser Ser Leu Ser Cys Ser Leu Cys Arg Lys Pro Leu 
35 40 45 

CTT CAC TCC ACT GCC CTC TCT AGT TCC CGA CCC TTT TTC TCT CCT GGC 300 
Leu His Ser Thr Ala Leu Ser Ser Ser Arg Pro Phe Phe Ser Pro Giy 
50 55 60 

TTT CCC TGC CAA ATT TCT CCA AGG AG? GGT CTA CAY CCT CTG 342 
Phe Pro Cys Gin lie Ser Pro Arg Ser Giy Leu His Pro Leu 
65 70 75 



(2) INFORHATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 
<3) TYPE: NUCLEIC ACID 
iC) STRANDEDNESS : DOUBLE 
(D) TOPOLOGY: LINEAR 

(ii: MOLECULE TYPE: CDMA 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Horto Scipiens 
(r) TISSUE TYPE: Ovary 

iiii] FEATURE: 
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.-•A) NAME/KEY: sig^pertide 
(5) LOCATION: 202.*. 34 9 

(C) IDENTIFICATION METHOD: Von Hfti jnf» matrix 

(D) OTHER INFORMATION: score 5 

seq PLLLLLREELVTG/AV 

;xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 



ATATTTAGT? CCTTTATTTT 7TTCTTTTCA AATGAATGGC TTTTAA-AGTA CATGTTATGT 60 

GAAGTATTCA C.AAACACTGG 7GCTTCCATG ATTATTGAGG AACATGTGAT TTA7AAAATG 12C 

CCTCACTGTT TTCC.VVGATA CACGATTGCG TCTGGGCACA GTTGATTTCT CCTTGCCTAC 180 

TCCCCCTCGC CCCTCACCCC C ATG AGT GAC AGA AAA AGA ACT PiPJK TTC TCA 231 

Met Ser Asp Arg Lys Arg Thr Lys Phe Ser 
-45 -40 

TAT GTC CK\ CTC CCA TGC CCA ATC TCC CTT CTC CCA CGC AGT TTT AAA 279 
Tyr Vai Glr. Leu Pro Cys Pro He Ser Leu Leu Pro Arg Ser Phe Lys 
-35 -30 -25 

AGG GGA C.--^ ATC CCA GGT CCC TCG GCT CCA CCA CTT CTT CTT CTT CTG 327 
Arg Giy Gin. He Pro Gly Pro Ser Ala Pro Pro Leu Leu Leu Leu Leu 
-20 -15 -iO 

CG? GAG GAG TTG GTT ACC GGG GCC GTG 354 
Arg Glu Giu Leu Val Thr Giy Ala Val 
-5 1 



{2) INFORilATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: s:Lg_pepticie 
(3) LOCATION: 12. . 134 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 

seq FCFFPAFLVXVX3/QP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 



ACTCTTCGTT T ATG ACT CCG TTC CCC TCC GGC CCT CCT AGA GAG GCC TCC 50 
Met Thr Pro Leu Giy Ser Gly Pro Pro Arg Giu Ala Ser 
-40 -35 -30 
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ATA GCG CAG-G7T CGT GGG TIC TCG TGG ACC TTT TTC CGT GTA GCT TTC 98 
lie Aia Gin Vai Arg Gly ?he 3er Arc Thr Fhe ?he Arc Vai Ala Phe 
-25 -20 -15 

T3C TTC TTC CCG GCA TTC CTT GTT WCG GTT TTM TCA CAG CCC 7CT GGM 14 6 
Cys Phe Phe Pre Aia Phs l.eu Vai Xaa Vai Xaa Ser Gin Pro Ser Gly 
-10 -5 

TTT TCC ACC ACT GAG ACA CTT TGC GCT CAG GAC TTC AGT GAC GTC ATC 194 
Phe Ser Thr Thr Giu Thr Leu Cys Aia Gin Asp Phe Ser Asp Vai lie 
5 10 15 20 

TTT CTG CGG CGC GCR GAC ACC CGC CGG TGG AAG AAG AAA CAG CTC CGC 24 2 
Phe Leu Arg Arg Ala Asp Thr Arg Arg Trp Lvs Lys Lys Gin Leu Arc 
25 30 35 

CGC CGG 24 8 

Arg Arg 



(2) INFOEIMP.TTON FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 base pairs 

CB) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 93.. 137 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5 

seq CSALFPLLSLLSC/KE 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NU: 128: 



ATGTGCTGAA ACTTAATCAG CAATGTGATG GTAATAGGTG GGGCCTTTAA AGGTGATTAA 60 

GTCATGTGAG TGACCTTTAT AAAJVAAGGCT TC ATG CGT TGT TCA GCT CTC TTT 113 

Met Arg Cys Ser Aia Leu Phe 
-15 -10 

CCC CTT CTA TCT CTT TTG TCA TGC .AAA GAG AGG ATR TGG TGT TTG TCC 161 
Pro Leu Leu Ser Leu Leu Ser Cys Lvs Giu Arg Xaa Trp Cys Leu Ser 
-5 *1 5 

ACA TTG GAG GAT GCA GCG ACA DGG CGT CAC CTT GGA AGT AGA GAG CAG 209 
Thr Leu Giu Asp Ala Aia Thr Xaa Arg His Leu Gly Ser Arg Giu Gin 

10 ' 15 20 



CCC TCA GGG GAT GCT GAG CCT GTG QP-A GTA TGG 
Pro Scr Cly Asp Aia Giu Pro Vai Giu Vai Trp 



242 
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[2) INtGRMa.TICN FOR SEQ ID MO: 129: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1^5 base pairs 
(3) TYPE: N'JCLEIC ACID 

(C) STRAMDE0NE33: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ill MOLECULE TYPE: CDNA 

(vii ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
[T) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 41.. 103 

{O IDENTIFICATION METHOD: Von Heijne matrix 
(D» OTHER INFORMATION: score 5 

seq IISLLKLCSrCFI/KD 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 



AATTTAAGAT .iLATATCCAGT TCATGTAGAC ATGAATATAT ATG CTT TAT GAT CAA 55 

Met Leu Tyr Asp Gin 
-20 

TAT TAC CTG ATA ATA TCA CTA CTA AAG CTA TGT TCT TTT TGC TTT ATT 103 

Tyr Tyr Leu lie lie Ser Leu Leu Lys Leu Cys Ser Phe Cys Phe lie 
-15 -10 -5 

AAA GAT TTT AAA GCC AGC AAC ATC ACT TTG GTA GTG ATA TTG 145 

Lys Asp Phe Lys Ala Ser Asn lie Thr Leu Val Val lie Leu 
1 " 5 10 



(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D> TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vij ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYfE:: UtcrUG 

(i;<) riATURE: 

[k) NAME/KEY: 2 i g__peptide 
(3) LOCATION: 71 '.265 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
;D) OTHER IMror.MATIOK: score 5 
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AaCAGACCGT GCCZkCCCCT AGATT3TTC? TAAGCTCTTT TTTGCATCTT TTACT7GCCT 50 

AGACTCTGP.i. ATG GCT AAC TGT T7C CTA TCA CAT AAG AGC CAA ACT ATT iC9 
Mec Aia Asn Cys Phe Leu Ser His Lvs Sec Gin T^ir lie 
-65 -60 -55 

CTA ATT TCA AAG CCT GCT CTG ACT CAG TCC CAT TTT ACC TCT CCA GCC 157 
Leu lie Ser Lys Pro Ala Leu Thr Gin Ser His Phe Thr Ser Pro Aia 
-50 -45 -40 

GGC TTG TTT CTA ACT GTT GAG AAA TCA CAC CTT TTG ACA AGG CTG TTT 205 
Gly Leu Phe Leu Thr Vai Glu Lys Ser His Leu Leu Thr Arg Leu Phe 
-35 -30 -25 

TTT CAC TGG CTT TCG TTA GTG CTG TGC TCG TTT CTG TCT TTG AGA TTT 253 
Phe His Trc Leu Ser Leu Vai Leu Cys Ser Phe Leu Ser Leu Arg Phe 
-20 -15 -10 -5 

TGC ACA TTA TCT TTT ATG TGC TCT TTT GCC CTT TTC CAC CTG 295 
Cys Thr Leu Ser Phe Met Cys Ser Phe Ala Leu Phe His Leu 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

vA) LENGTH: 298 base pairs 

(B) TYPE: NUCLEIC ACID 
■:C) STRANDEDNESS : DOUBLE 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/ KEY: sig_peptide 
(3) LOCATION: 20. .73 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER IKcORaATION: score 5 

seq LTYLLFLPDWAAV/FE 

(xi; SEQUENCE DESCRIPTION: SEQ ID NC: 131: 



AACGGACAGA TTTATTGGA ATG CAT GGA GCT GGT CTG ACC TAT TTA CTT TTC 52 

Mcc His Cly Aia Gly Leu Thr Tyr Leu Leu Phe 
-15 -10 



CTT CCA GAC TGG GCT GCT GTA TTT GrJ\ CTG TAC AAC TGT GAA GAT GAA 
Leu Pro Asp :r? Ala Ala Vai Phe Glu Leu Tyr Asn Cys Glu Asp Glu 
-5 15' 



too 
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CCC TG7 TAC,TTA GAC TTG GCC AGO CTG AGA GGC GTT CAC TAG ATC ACT l^r 

Ar;; Cys Tvr Leu Asp Leu Ala Arg Lea Arg Giy Val His Tyr lie Thr 

10 15 20 25 

TGG CGA CGG CAG AAC AAA GTC TTT CCT GAG GAT AAG GGC CAC CAT CCA 195 

Trp Arg Arg Gin ksn Lys Val Phe Pro Gin Asp Lys Gly His His Pre 

30 55 40 

ACC CTG GGG GAG CAC CCG AAG T?C ACC AAC TAG TCT TTC GAT GTA G.^l^. 24 4 

Thr Leu Gly Giu His Pro Lys Phe Thr Asn Tyr Ser Phe Asp Val Giu 

45 50 55 

GAA TTT ATG TAT CTT GTC CTT CAG GCT GCA GAC CAC GTA TTG C.Vi CAC 292 

Glu Phe Kec Tyr Leu Val Leu Gin Ala Ala Asp His Val Leu Gin His 

60 65 70 

CCC GGG 298 
Pro Giy 
75 



(2) INFORMATION FOR SZQ ID NO: 132: 

(i) SH:QU£NC£ CHARACTERISTICS: 

(A) LENGTH: 148 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANOEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
{:) TISSUE TYPE: Ovary 

(ix) rZATURE: 

(A) NAME/KEY: sig_pepticle 
(3) LOCATION: 26. .70 

(C) IDENTIFICATION METHOD: Von Hei:jr.e matrix 

(D) OTHER INFORMATION: score 4.9 

seq C13ATLAFSGSFL/AP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 



ACGCCAAACT CCTCCAGCTG GCCCC ATG TGC TGC CTT TCT GCC ACG CTA GCC 52 

Met Cys Cys Leu Ser Ala Thr Leu Ala 
-15 -10 

TTT TCA GGC TCT TTT CTG GCT CCC CAC CTC ATC TTT TGC TGT TTC TCC 100 
Phe Ser Gly Ser Phe Leu Ala Pro His Leu He Phe Cys Cys P?.e Ser 
-5 1 5 10 

CAC CTG AAT GTC ATC ATC CTC CTA TCC TCA TTA TCC CCT ATC C.-.C GGG 118 
His Leu Asn Val lie He Leu Leu Ser Ser Leu Ser Pro lie Hi3 Giy 
15 20 :> 



[7.) IMrG?.y.-.TrCK for SF.Q to NO: 133: 
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(i) SEQUENCE CHAR-=.CTERISTICS : 

(A) LENGTH: 1^2 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANOEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

iii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 38.. 154 

(C> IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score 4.9 

seq SGLRGLLLQEALG/AV 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 



AACAGAAGAA AGACAGCCTA GGAGCAGAGC CTCCCAG ATG GCT GAG TTG GAT CTA 55 

Met Ala Glu Leu Asp Leu 
-35 

ATG GCT CCA GGG CCA CTG CCC AGG GCC ACT GCT CAG CCC CCA GCC CCT 103 
Met Ala Pro Gly Pro Leu Pro Arg Ala Thr Ala Gin Pro Pro Ala Pro 
-30 -25 -20 

CTC AGC CCA GAC TCT GGG TTG AGG GGG CTG CTG TTG CAG GAG GCC CTG 151 
Leu Ser Pro Asp Ser Gly Leu Arg Gly Leu Leu Leu Gin Glu Ala Leu 
-15 -10 -5 

GGA GCA GTG CCG GAC CCC AGG 172 
Gly Ala val Pro Asp Pro Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CH.ARACTERISTICS : 

(A) LENGTH: 370 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(i:-:) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: 2037.286 

(C) IDENTIFICATION METHOD: Von Heijne matri:-: 

(D) OTHER INFORMATION: score 4.9 

seq FLVACPLFGVCLX/Fr 
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(:<ii StQUENCE DESCRIPTION: StQ ID NO: 134: 



ACTTCAGTAA ATCTATTATT GATGTA.ATAC TTTTGTCTAA TTA3CATTCA TATTCTAATT 60 

TTGTCAGTTG TTCAATAATA TCCTTTTTGA CAATTTTTCC TCCAGTGAGG GATCAAGTC7 120 

AGGGCTGGAT A7TGTGTTTC ATTGTCATGT ATCTTGAGTC CCCTTTA.ATC TGGGAGAG7T 130 

CCTCAGCTTT GCTTTGTGTC TT ATG ACA TTA AGA CAT GGG AAT AAT ATC CTC 2 32 

Met Thr Leu Thr His Giy Asn Asn lie Leu 
-25 -20 

CAC CTC GCC AAC TTT TXT TTA G7A GCA TGT CCT TTA TTT GGG GTT TGC 280 
His Leu Ala Asn Phe Phe Leu Val Ala Cys Pro Leu Phe Gly Val Cys 
-15 -10 -5 

CTG AWR TTT TTC ATT CTT AGA TTC AGG TTA TAC ATT CAA GGC CCA AAT 328 
Leu Xaa Phe Phe lie Leu Arg Phe Arg Leu Tyr He Gin Gly Pro Asn 
1 5 10 

GTC ACA CAA GTG ATA TTG CAT CTG TCT CAG GGA ACC TTG AGC 370 
Val Thr Gin Vai He Leu His Leu Ser Gin Gly Thr Leu Ser 
15 20 25 



(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii; MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hozno Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 131.. 222 

(C) IDENTIFICATION METHOD: Vor. Heijne macri.x 

(D) OTHER INFORMATION: score 4.9 

seq VIRWLPWPRGSHS/DS 

(xi) SEQUENCE DESCRIPTION: SEQ ID N'O: 135: 



AAGTATCCAG CCTCAACATT CAGCAGAGGC CCCAGATCAG CGTC7GAGCC AGGCCAACAA 60 

TGACC.AAGGA 3GATGGGATC CTGGGTGCAG CTCATCACA.A GCGTCGGG7G AGTCCGAGGC 120 

CCCAGCTCTC TGCCCTCCTG KTCCTC7GCT CTCTCCTGGT CCTCCCAG7T C7AC7GGCTC 180 

r.TG GTG TTC .AGA TGG TTG CCT TGG CCT ACG GGG TCA CAC AGC GAC TCG 223 

t<e: Val Leu Arg Tro Leu Pro 7rp Pro Arg Glv Ser His Ser k^v Ser 
-10 -5 
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•:2) INFO=MATICN FOR SEC ID MO: 136: 

{i- se:quh:nc£ characteristics: 

(A) LENGTH: 156 base pairs 

(3) TYFI: NUCLEIC ACID 

(C) STRANDEDNE3S: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ill MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Uterus 

(i.v) FEATURE: 

(A) NAME/KEY: sig_pepcide 

(B) LOCATION: 50.. 121 

(CI IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score 4.3 

seq rSFLGTLFHKSNS/ED 

•Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 



ATAGTATT3A TGCTGGGTCA AACTAGTTAG GAGGATTTTC AGTTCTCCC ATG AAA GCA 58 

Met Lys Ala 

AGG CTC TCT GGT AAT CTG ATT TGT TTT TCT TTT CTA GGA ACC CTC TTT 106 
Arg Leu Ser Gly Asn Leu lie Cys Phe Ser Phe Leu Giy Thr Lau Phe 
-20 -15 -10 

CAT ?JJK TCA AAC TCA GAA GAC AGC TCT GTA GGA AAA GGA GAC TGG AAG 154 
His Lvs Ser Asn Ser Glu Aso Ser Ser Val Gly Lys Giy Aso Tro Lys 
-5 1 5 'lb 

AAG AAA AAT AAG 166 
Lys Lys Asn Lvs 

is 



12) INFORMATION FOR SEQ ID NO: 137: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 217 base pairs 
(9) TYPE: NUCLEIC ACID 
(C) STRANDEDNESS: DOUBLE 
(0) TOPOLOGY: LINEAR 

(ii) KOLECULE TYPE: CDNA 

ivi; ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 

(i:<) FEATURE: 

(A) NAt^E/KEY: sig peptide 
(3) LOCATION: 1077.154 
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..;C) IDENTiriCATIGM METHOD: Vcn Hei^ne ra'riy 
(D) OTHER IMFORMATION: score 4.8 

seq VCLVFQTPSLCr.a/Ki-; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

A.SiTGAACGAA CGG3GAAAGT GCATGTTGTA GTTCTCAAAA CZC?APAPAA TCTAAGAGPA 60 

ACCCAGCAGC AAGAAACACA GAGGTTTGGG TGTCAGCATC GGAGGA ATG TCT CAC iI5 

Met Ser His 
-15 

GTC TGC CTT GTC CCC CAG ACC CCG TCC CTG TGT CTG GGC AAA GGC ACG 153 
Vol Cys Leu Val Pro Gin Thr Pro Ser Leu Cys Leu Giv Lys Giy Thr 
-10 -5 * 1 ' 

CCC CGC TCC AGG TCG GCC CCA TTT CAG AGC AGT GGC CC? CAT AGG CTT 211 
Pro Arg Ser Arg Ser Ala Pro Phe Gin Ser Ser Glv Pro His Arg Leu 
5 10 15 



TGT GCG 
Cys Ala 
20 



217 



(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2S6 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDMA 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Hono Sapiens 
(r) TISSUE TYPE: Spleen 

(ix) cZATURS: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 93.Tl79 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.8 

seq VLTSVNLFIGING/SV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 



ACTGCTTCCA 3CAKAAGTCC TATGTGTCCT CCACCA^VTCT GCCTGTGCTA GCCCTTCTAC 50 

TTTTGCTGTA TGGGTGGTCA ATCACACCTC TC ATG TAC CCA GCC TCC TTT GTG 113 

Met Tyr Pro Ala Ser Phe Val 
-25 

TTC iVkG ATT CCC ACC ACA GCC TAT GTG GTG CTC ACC AGC GTG AAC CTC 151 
Pne Lys lie Pro Ser Thr Ala Tyr Val Val Leu Thr Ser Val Asn Leu 
-20 -15 -10 
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TTC ATT GGC- ATT AAT GGC AGC GTG GCC 
Phe He Gly lie Asn Glv Ser Vai A' a 

ACC GAC PlAT kJ\Q CTG AAT AAT ATC 
Thr Asp A5n Lys Leu Asn Asn lie Asn 
15 

TTG ATC TTr CCA CAT TTT TGC CTG GGA 
Leu lie Fhe Pro His Phe Cys Leu Giy 
30 35 
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ACC TTT GTG CTG GAG CTG TTC ^'OS 
Thr ??.e Val Leu GIj Leu Phe 
5 10 

GAT ATC CTG K\G TCC GTG TTC 257 
Asp lie Leu Lvs Ser Val Phe 
20 25 

CGA GGG CAG ACG 2 96 

Arg Giy Gin Thr 



(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 290 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A; ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 165.. 254 

(C) IDENTIFICATION METHOD: Von Heijne niatrix 
(0) OTHER INFORMATION: score 4.8 

seq RSSLWVTAPLVSA/CP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 



AAGAGCCTCT TGSATCCCCA CAGGGYAATG GGTGTBCCGA TCTCGCGGGG G.ACTCTGTGA 60 

TCCGTGTTCC CCTGACCCTC CTAGTGCACA ACTTGGCCGG GCTCACTGGG CTCCTGCACC 120 

ACTGCCTGTC AGGTCCGCTG CCAGCCCCAA GCCCCCCACC AGCC ATG AGC TCC TCC 176 

Met Ser Ser Ser 
-30 

AGA AAG GAC CAC CTC GGC GCC ASA GCT CAG AGC CCC TCC CGG TCA TCA 224 
Arg Lys Asp His Leu Gly Ala Xaa Aia Gin Ser Pro Ser Arg Ser Ser 
-25 -20 -15 

TTG TGG GTA ACG GCC CCT CTG GTA TCT GCC TGT CCT ACC TGC TCT CCG 272 
Leu Trp Val Thr Aia Pro Leu Vai Ser Ala Cys Pro Thr Cys Ser Pro 
-iC -5 1 5 

CCT ACA C.-.C CCT ACC GGC 2?0 
Aia Thr His Pre Thr Gly 
10 



(2) INFCRI-Ji.TIDN FOR SEQ ID N'O: 1<:0: 
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:i; SZQUSMCE CHARACTERISTICS: 

{A; LENGTH; 397 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDKESS: DOUBLE 
(D; TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Spleen 

{ixl FEATURE: 

{A) NAt^E/KEY: sig^pepcice 

(B) LOCATION: 230.. 286 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.8 

seq ATYLVQSSACCPA/IV 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


140: 




ACCACGGGGA 


CAAGGACTGC 


KCCC.ACGATG 


GTGCTCCTGC 


CAVGCCCCAG CTG3ACGGGG 


60 


AGTCCTGTGG 


GGCCCAGGCC 


TTGA.ACAGCC 


ACATGCCTGC 


TGAGACCGAG GAGCTGGGAC 


120 


GGTGGGGACC 


ACAGAGAGCA 


ACCTGATTAC 


CTCCCTGCTT 


GGGCTGTGCC AGAGCAAGAA 


180 


GAGTCGGGTG 


GCCTTG.AAGG 


CCCAGGAG.AA 


CCTGCTGCTC 


CTGGTGAGC ATG GCC TCC 
Met Ala Ser 


238 



CCA GCA GCT GCC ACC TAG CTG GTA CAG AGC AGC GCC TGC TGC CCT GCG 286 
Pro Ala Aia Ala Thr Tyr Leu val Gin Ser Ser Aia Cys Cys Pro Ala 
-15 -10 -5 

ATC GTC CGG CAC CTT TGC CAG TBG TAC CGG TCC ATG CCT GTC TTC CTG 334 
lie Val Arg His Leu Cys Gin Xaa Tyr Arg Ser Met Pro Val Phe Leu 
1 5 - 10 15 

GAC CCC GCA GAS ATT GCC ACC TTA GAG GGC ATC AGC TGG AGG TTA CCC 382 
Asp Pro Ala Xaa lie Ala Thr Leu Glu Gly He Ser Trp Arg Leu Pro 
20 25 30 

AGT GCC CCG TCT GAT 397 
Ser Ala Pro Ser Asp 
35 



(2) INFORM.ATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37B base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDWES5: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii; KOLECULE TYPE: COifA 
(vi: ORIGINAL SOURCE: 
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:A) ORGANISM: Hone Sapiens 
■F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_pec-ide 

(B) LOCATION': 172.. 354 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.7 

seq LLPCNLHXSWLHS/SP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 



AGATTGGCTG GGCAGATGGG CT3ACTGGC7 GGGCAGATGG GTGGGTGAGT TCCCTCTCCC 60 

CAGAGCCATC GGCCAGGTAC CAAAGCTCAG CTGTATGGAT TCCCAACAGG AGGACCTGCG 120 

CTTCCCTGGG ACCCATTGTT GTACTGGATT AACAAGCGAC GGCGCTACGG C ATG AAT 177 

Met Asn 
-60 

GCA GCC ATC AAC ACG GGC CCT GCC CCT GCT GTC ACC AAG ACT GAG ACT 225 
Ala Ala lie Asn Thr Glv Pro Aia Pro Ala Val Thr Lys Thr Glu Thr 
-55 -50 -45 

GAG GTC CAG AAT CCA GAT GTT CTG TGG GAT TTG GAC ATC CCC GAA GCC 273 
Glu Val Gin Asn Pro Asp Val Leu Trp Asp Leu Asp He Pro Glu Ala 
-40 -35 -30 

AGG AGO CAT GCT GAC CAA GAC AGC AAC CCB NMG GCG GAA GCC CTG CTC 321 
Arg Ser His Ala Asp Gin Asp Ser Asn Pro Xaa Ala Glu Ala Leu Leu 
-25 -20 -15 

CCC TGC AAC CTG CAC TGM AGC TGG CTC CAC AGC AGC CCC AGG CCA GAT 369 
Pro Cys Asn Leu His Xaa Ser Trp Leu His Ser Ser Pro Arg Pro Asp 
-10 -5 1 5 

CCC CAT TCC 378 
Pro His Ser 



(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 base pairs 

(B) TYPE: NUCLEIC ACID 
CO STRANDEDNES3: DOUBLE 
(D) TOPOLOGY: LINEAR 

(ii; MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(i:-:) FEATURE: 

(A) NAME /KEY: sig_pepticle 
(3) LOCATION*: 130.. 308 

.;C) IDENTIFICATION' METHOD: Von Heijne matrix 
iO) OTHER INFORMATION: score 4.7 



wo 99/06549 106 

seq GIFLVIFC3ESFS/LL 
(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 14 2: 
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AATGGAMTTC TGGGTTGACA RATGTTTTGT TGTGTTTTGT TTATWCCTCA TCTGTTCTCT 60 

A7TGTTTCTA GTTTGTAGTC AGCATTCATA GGTGTACTTG ATTCCTCCTA 7RWTATTAGN 120 

NTCTAGCTGT TTTCAGGRAT TTC7CTTTKA TT7TTGAGT7 CCAG7AGT77 GAC7A7AAT 179 

A7G ATA AAC CTA CTT GTG GGT AAC 7GC ATT TAT CTG C7T GGA GCT ATT 227 
Met lie Asn Leu Leu Val Giy Asn Cys lie Tyr Leu Leu Giy Ala lie 
-40 -35 -30 

AGA GCT 7CT TGC ATG TGT AGA TKB ATG 7CT TTC GCC AAA TTT GGG ATT 275 
Arg Aia Ser Cys Met Cys Arg Xaa Met Ser Fhe Ala Lys Phe Giy lie 
-25 -20 -15 

TTT CTT GTA ATA TTT TGT TCT GAA TCA TTT 7CT CTT CTC CTC TGG AAC 323 
Phe Leu Val He Phe Cys Ser Giu Ser Phe Ser Leu Leu Leu Trp Asn 
-10 -5 1,5 

TTC AGT TCA ATA TAT GTT AAG ACC TTT TGG CCA GTG GGG 362 
Phe Ser Ser lie Tyr Val Lys Thr Phe Trp Pro Val Giy 
10 15 



(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE 7YPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 16.772 

(C) IDENTrFtCATTON METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.7 

seq LRFLLRDPGCLLA/QP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 



AAGACGGCGG TGCGC ATG CTC TGT TGC GGT CCG CTT CGG TTT CTG TTG CGG 51 
Met Lau Cys Cys Giy Pro Leu Arg Phe Leu Leu Arg 
-15 -10 

GAC CCG GGG TGT CTC CTA GCG CAA CCG GAA CTA GCC TTC TGG GGG CCC 99 
AsD Pro Giy Cys Leu Leu Ala Gin Pro Glu Leu Aia Phe Trp Giy . Pro 
-5 1 5 
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GCT TCC T-ZZ ATC TCT GGC GGC CTT GTA GTC GTC TCC GAG AC^ C-C ^AC 

Ala Ssr Phe lie Ser Gly Giy Leu Val Val Val Ser Giu Th- ^-o ^^s 

15 20 ' 25 

CCC TCC TTC CCT CTT GAC CCC CCG 

Pro Ser ?h.e Pro Leu Asp Pro Pro 
30 



147 



171 



(2) INFORMATION FOR SEQ ID NO: 144: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 37 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGV: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

<A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 



(ix) FEATURE: 

(A) NAME/KEY: sig_peotide 

(B) LOCATION: 360.. 416 

(C) IDENTIFICATION METHOD: Von Hei3ne matrix 

(D) OTHER INFORMATION: score 4.7 

seq ILLRMTVLPTLWT/RR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

AAGAGAGAAA CTTGGCGATC ACGTTTTCAC ATGATGCTCA CGCTCAGGGC GCTTCAATTA 60 

TCCCTCCCCA CAAAGATAGG TGGCGCGTGT TTCAGGGTCt C7CGTCTCTC TCCTACAGAA 120 

AAGAAAAAGA AAAAAATGTC ATTAGAAGAG GCGTAACACG TCAGTCCGTC CCCAGATCGA 180 

GCCTGCGTGC TGCCGAAGCA GGGCGCCGAG TCCATGCGAA CTGCCACCTG ATCCGCTCTT 240 

ATCAATGAAG CAGCCGATCA TG6CGGATGG CCCCCGGTGC AAGAGGCGCA AACAAGCCAA 300 

TCCCAGGAGG AAAAACGTGG TGAACTATGA CAATGTAGTG GACACAGGTT CTGAAACAG 359 

ATG AGG AAG ACA AGC TTC ATA TTG CTG AGG ATG ACG GTA TTG CCA ACC 407 
Met Arg Lys Thr Ser Phe He Leu Leu Arg Met Thr Val Leu Pro Thr 
-15 -10 -5 

CTC TGG ACC AGG AGA CGA GTC CAG CTA GTG 437 
Leu Tro Thr Arg Arg Arg Val Cln Leu Val 
1 5 



{2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHAR.ACTERISTICS : 

(A) LENGTH: 153 base oairs 
(3) TYPE: HUCLEIC ACID 
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,AZ] 3TPANC£DNE3S: DCUBLi 
(D) TOPOLOGY: LINEAR 



(ii) KOLEC-JLE TYPE: CDNA 



(vi) ORIGINAL' SOURCE : 

{A> ORGANISM: Homo Sapiens 
iz) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: 31.799 

(C) IDENTIFICATION METHOD: Von Keijne matrix 

(D) OTHER INFORMATION: score 4.6 

seq VRVGLVLVXRALC/LX 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 



AGGGAAGGGA GGGCAGGCGG KGCTGGAGTB ATG TGG TGG AAA OCT GCT CCT GAG 54 

Met Trp Trp Lys Pro Ala Pro GIu 
-20 

GAA GGG GTC CGG GTG GGG TTG GTG CTT GTG TSA AGG GCT CTG TGC CTC 102 
Glu Giy Val Arg Vai Giy Leu Val Leu Vai Xaa Arg Ala Leu Cys Leu 
-15 -10 -5 1 

TKT GTA CTC TCT CGG TTC ATG TTC ASA AAT CCT GGC CTT GGT GGC ATG 150 
Xaa Vai Leu Ser Arg Phe Met Phe Xaa Asn Pro Giy Leu Giy Glv Met 
5 10 15 " 

GGG i53 
Giy 



(2) INFORMATION FOR SEQ ID NO: 146: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 54 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 



{ii) MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 



(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: 3747.415 

(C) IDENTIFICATION METHOD: Von Hei]ne matrix 

(D) OTHER INFORMATION: score 4.6 

seq FNFLLGNSSCVYQ/RP 

(xi) SEQUENCE CZSCRIPTION: SEQ ID KO: 146: 



^^.-.^-rcTc.r.r.c. CAGCTTCTTC CCXAGCTCTA TCCTHCTTCC CTCCATCTCC ".-.TAGGATTC fiO 
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GAGCCCTGCC 


120 


TAAGACTCAT 


GCTACAAGAA GTTAAATAAG 7TTCCCGAAG TCACACAGCT 


A3CCTCTCAT 


180 


CCCTTTTCTA 


CTGAGAGGAA GTGGAATGCA CTCCGACAAG GATAAGGTTT 


TATTGTGAGC 


24 0 


TGGCCTTGGA 


A7TAAACCAC CACC.AACACA CTTTTGGA7T ATCAGNNGGT 


GGAAGGAGTG 


30C 


CAAATGCCAG 


TTACGGTGAT GCGTTCAACA TCCTTATTTC CAGTTCAGA.A 


TTTCCCTGGA 


36C 


GCTCCAAATT 


TTT ATG TTT AAT TTC TTA CTG GGC AAT TCC AGT 
Met Phe Asn Phe Leu Leu Gly Asn Ser Ser 


TGT GTA 
Cys Vai 


409 



-10 -5 

TAT CAA AGG CCC ATC AG A TTA AAA CTC ATT ATC TTC CCA TCA GGG 4 54 

Tyr Gin Arg Pro lie Arg Leu Lys Leu He lie Phe Pro Ser Gly 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 14 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 413 base pairs . 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
{B) LOCATION: 57.7l82 

CO IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score 4.6 

seq LDPAVSLSAPAFA/SA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 



ORAAACCGGA GCCACAGAGG ACAGGGTAGA GTCGCAGAAA GGAGAGACAC ACATAC ATG 59 

Met 

AAA AGA GGA GCT TTC TCC AAT CTT AAT GAT TCC CAG CTC TCA GCC TCG 107 
Lys Arg Giy Ala Phe Ser Asn Leu Asn Aso Ser Gin Leu Ser Ala Ser 
-40 -35 -30 

TTT CTG CAA CCC AGC CTG CAA GCA AAC TGT CCT GCT TTG GAC CCT GCT 155 
Phe Leu Gin Pro Ser Leu Gin Ala Asn Cys Pro Ala Leu Asp Pro Ala 
-25 -20 -15 -10 

GTG TCA CTC TCC GCA CCA GCC TTT GCC TCT GCT CTT CGC TCT ATG AAG 203 
Vai Ser Leu Ser Ala Pro Ala Phe Ala Ser Ala Leu Arg Ser Met Lys 
-5 15 

TCC TCC CAG GCT GCA CGG AAG GAC GAC TTT CTC AGG TCT CTT AGT GAT 251 
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Ser 5er Gin* Ala Ala Arc* Lys Asp Asp Phe Leu Arg Ser Leu Ser Asp 
i2 15 " . 20 

GGA GAC TCA GGG ACA TCA GAA CAC ATC TCA GCG GTG GTG ACT AGC CCT 299 
Gly Asp Ser Giy Thr. Ser Glu His lie Ser Ala Val Val Tnr Ser Pro 
25 3C 35 

CGG ATT TCC TGC CAT GGT GCT GCC ATT CCC AMM GCM MGT GCC CWC TGK 34 7 
Arg lie Ser Cys His Gly Ala Ala He Pro Xaa Aia Xaa Ala Xaa Xaa 
40 45 50 55 

MTA GGC TGT TCC TGC TGM ACC GAA CGM MTC CTC MTG MCA CCG CCC TCC 395 
Xaa Gly Cys Ser Cys Xaa Thr Glu Arg Xaa Leu Xaa Xaa Pro Pro Ser 
60 65 70 

CTC CTT TCT TTA GAA GCC 413 
Leu Leu Ser Leu Glu Ala 
75 



(2) INFORMATION FOR SEQ ID NO: 148: 

(il SEQUENCE CHAPACTERISTICS: 

(A) LENGTH: 271 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANOEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: 32-7l03 

(C) IDENTIFICATION METHOD: Voa Heijne matrix 

(D) OTHER INFORMATION: score 4.G 

seq FFIFCSLNTLLLG/GV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 



A.'^CAACTATC CTGCCTCCTC CTTGCTGCAC C ATG AAG TCT GCC AAG CTG GGA 52 

Met Lys Ser Ala Lys Leu Gly 
-20 

TTT CTT CTA AGA TTC TTC ATC TTC TGC TCA TTG AAT ACC CTG TTA TTG 100 
Phe Leu Leu Arg Phe Phe He Phe Cys Ser Leu Asn Thr Leu Leu Leu 
-15 -10 -5 

GGT GGT GTT AAT AAA ATT GCG GAG AAG ATA TGT GGA GAC CTC AAA GAT 148 
Gly Gly Val Asa Lys He Ala Glu Lys He Cys Gly Asp Leu Lys Asp 
15 10 15 

CCC TGC AA.^; TTG GAC ATG AAT TTT GGA AGC TGC TAT GAA GTT CAC TTT 196 
Pro Cys Lys Leu Asp Ket Asn Phe Gly Ser Cys Tyr Glu Val Kis Phe 
20 25 30 
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AGA 7AT TTC-TAC A.AC AGA ACC TCC AGA TGT GAA ACT TTT GTC TTC 

Arg Tyr Phe Tyr Asn Arg Thr Ser Lys Arg Cys Giu Thr ?he Vai Fas 
35 4C ' 45 



244 



TCC AGC TGT AAT GGC AAC CTT AAC GGG 
Ser Ser Cys Asn Giv. Asn Leu Asn Gly 
50 55 



27: 



(2) INFORMATION FOR ScQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 base pairs 

IB) TYPE: NUCLEIC ACID 

(C) STRAN'DEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 31.. 75 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.6 

seq ILFPLHSVIGSHP/QC 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 



AAGTTGCCTG AGGACAGCAG TSCAGTTGAC ATG GAT ATT CTC TTT CCT CTG CAC 54 

Met Asp lie Leu Phe Pro Leu His 
-15 -10 

AGT GTT ATT GGG AGC CAT CCT CAG TGC CTC CCA GAG AGG WGG ACA GCG 102 
Scr Val lie Gly Ser His Pro Gin Cys Leii Pro Glu Arg Xaa Thr Ala 
-5 15 

AGA ATG ATC AAG CTG AAG TGG GGG AAT GGC TCA GGA TCC CAT TTC GGG 150 
Arg Met lie Lys Leu Lys Trp Gly Asn Gly Ser Giy Ser Asp Phe Gly 
10 15 20 25 



(2) INFORMATION FOR SEQ ID NO: 150: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 30 base pairs 
(3) TYPE: NUCLEIC ACID 
(C) 3TRANDEDNESS: DOUBLE 
:D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

;A) CRGAMIStM: Homo Sapiens 
(F) TISSUE TYPE: Testis 
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(ix) FEATURE: 

(A) NAME/KEY: sig_pepcide 

(B) LOCATION: 275.. 355 

(C) IDENTIFICATION METHOD: Von Heijne matri.x 

(D) OTHER INFORMATION: score 4.6 

seq C3I1ILL50RQWS/KN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15C: 



GTTGGAAAAC 


AGTTTTGGCT 


CTGAGGACCC 


AGCAGTTGAC AAACAGGAGG CCTGGGACAA 


60 


GAGCAGTATG 


AGAAGTCAGA 


TCGCCTCTTT 


TAATGTCACT AGTCAGTACA GGCCTCGCCA 


120 


GAC/^J\GTCTC 


TCCTCARMNT 


CACTTGGAAG 


AACAGGCC3D CTCTTCATGA TCCTGGGTTT 


180 


CCTAGACWTA 


TTTCCAGGAC 


TGTTATGGGG 


ATTAGGGCCA ACTGTAAAAG TGGCTGAGGA 


240 


G.2-.CTAGGTAA 


AGAGTGTTGT 


CTCACTTTAG 


AACA ATG CTG AAG GTG TTT AGA GCC 
Met Leu Lys Val ?he Arg Ala 


295 



-25 

TGM CAT OCT AAA ATA TGC CAC TTT GGC ATA CTG ATT CTT CTG AGC CAG 343 

Xaa His Pro Lys lie Cys His Phe Giy lie Leu lie Leu Leu Ser Gin 
-20 -15 -10 .5 

AGG CAA TGG AGC AAA AAC AGA TGC AGG GAA GGC TGT CTG ACC ACC CTC 391 
Arg Gin Trp Ser Lys Asn Arg Cys Arg Glu Giy Cys Leu Thr Thr Leu 
1 5 10 

TTT CTG TTT GAA GCG GAA CAT AAA AGT TCC CTT GTG AAA 430 
Phe Leu Phe Glu Ala Glu His Lys Ser Ser Leu Val Lys 
15 20 25 



(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 base oairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

{VI) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig_pepticie 

(B) LOCATION: 219.. 320 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.6 

seq LXWRKL.WSWTLS/QE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 
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ACAATTAAAC CACACAGAAA ATGATGTGAC TCATCTTCAA AAGGAAATGA GCAATTGTAG 60 

AGCAGGTGAA A.«^CGCTGGCA TGGGTAGGTT CACTAAGGTG GGTGAGCAAG A;i-AGGACAGT 120 

GGACACCCTG CCGTCCCCCC AGCACCCCGT GGCTCATTGC TGCAGTCAGC TGGAGGAGAG 130 

GTGGCAGAGG TTGCAGAGCC AGG7CATCTC GGAGCTGG ATG CTT GTA AGG MT GCA 2 36 

Met Leu Vai Arq Asn Ala 
-30 

CGC AGG GGG TCC AGA GGG AGG TCT CCA TGG TGG AGG GCA GGG 7GT CTM 23 4 
Arg Arc Giy Ssr Arg Giy Arg Ser Pro Trp Trp Arg Aia Gly Cys Leu 
-25 -20 ' -15 

RTA TGG AGA AAA CTT GCA GCA AGC TGG ACT CTA TCT CAG GAA ATC TTC 332 
Xaa Trp Arg Lys Leu Aia Ala Ser Trp Thr Leu Ser Gin Glu lie Phe 
-10 -5 1 

AGA GGA TCA AGG AAG GGC TCG 35 3 

Arg Giy Ser Arg Lys Giy Ser 
5 iO 



(2) INFORMATION cOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE; Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(a? LOCATION: 61.. 147 

(C; IDENTIFICATION METHOD: Von Heijne matri:< 
(Dl OTHER INFORMATION: score 4.5 

seq FTLGLGYPIPTRL/QP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 



GATGGCGGCG ASGSGGACGG TSAAGGTTGC CT< 

ATG ACT .fi-AA G'JG CAT CAC CAC CAG CAT 

Met Thr Lys Giv His His His Gin His 
-25 

TTC ACC CTG GGC TTG GGA TAC CCC ATA 

Phe Thr Leu y^j Leu Giy Tyr Pro lie 
-1- -5 

ACA TTA AG? T::A GAC CCC CTT CTG GAC 

?hr I^su 3s r Zitr Asp ?rc Leu Leu Asp 
S ' 10 



CCGCCCG TCCGGGCTCT GATCCTCCCC 60 

CCC CTG CAT CCC CAC CCA CTC 103 
Pro Leu His Pro His Pro Leu 
-20 -15 

CCC ACT CGC CTG CAA CCA TGC 155 
Pro Thr Arg Leu Gin Pro Cys 



ATT ACC TGT TCC CTG AGA AGC 20; 
lie Thr Cys Ser Leu Arg Ser 
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CwA AGC 7CT GGG 2'6 
Pro Ser Ser Gly 
20 



INFOEOIATIOM cOR 3EQ ID MO: 153: 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 236 base pairs 

(3) TYPE: NUCLEIC ACID 

(CJ STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Spleen 

iix) FEATURE: 

(A) NAME/KEY: sig_peccide 
(3) LOCATION: 162.. 2 30 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.5 

seq RLHILFIVCLARG/KG 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 



AGCTGCTTAG TTTGCTAATT CTAGTGGTTC AAACCAGATT TCAAAATCTG GGCTAAATCT 60 

CTGTCATGCT ATGACATGGC ATTTGACAGT AATTCCTGAA TATTTAATTG ATAGAAAAAC 120 

AGAAAGCATG CAT.ATTGTTT AGTACAATTG TGTGAACTGC T ATG ACA TAT CAT KRC 176 

Met Thr Tyr His Xaa 
-20 

ATA CAG TTT TCT GAA AGA CTG CAT ATT TTA TTC ATT GTA TGC CTA GCA 224 
lie Gin Phe Ser Glu Arg Leu His lie Leu Phe lie Val Cys Leu Aia 
-15 -10 -5 

CGG GGA AA.~. GGG 236 
Arg Gly Lys Gly 



(21 INFOKC^TION. FOR SEQ ID NO: 154: 

(i; SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 230 base pairs 
U3) TYPE: NUCLEIC ACID 

STPuAiNDEDNESS: DOUBLE 
fO) TOPOLOGY: LINEAR 

(ii' :o:.i:CULE type: CDNA 

(vi' '\;MGIKAL SOURCE: 

ORGANISM: Homo Sap ions 
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(:) rrSSUE TYPE: Ovary 

iix) FEATURE : 

(A; NAME/KEY: sig peptide 
(5) LCCATION: 9..T4 6 

(C) ICEMTIFICATIOM METHOD: Vcn Heijne natrL>: 

(D) OTHER INFORMATIOM: score 4.5 

seq LIYCGLSQPLTLG/VT 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 



ATTG7ATC ATG TCT CAA TTT OCT CTC TGC AGC CCT CCG TGG P^AA CCA CTT 50 
Met Ser Gin Phe Pro Leu Cys Ser Pro Pro ?rp Lys Pro Leu 
-45 -40 * -35 

GTC AAG GTC TCC AGA AAC CTG AAA ATA AGG ATG TCC ATT CCA TGG CCA 98 
Vai Lys Vai Ser Arg Asn Leu Lys lie Arg Met Ser lie Pro Trp Pro 
-30 -25 -20 

CTC TCA GTC CTG ATT TAC TGT GGT CTC TCG CAG CCT TTG ACC CTG GGG 14 6 
Leu Ser Vai Leu lie Tyr Cys Gly Leu Ser Gin Pro Leu Thr Leu Giy 
-15 -10 -5 

GTC ACC TCT CCT TCC TTC CCC CAA AAC TCT TTC TTC CCT TGG CTT CCA 194 
Vai Thr Ser Pro Ser Phe Pro Gin Asn Ser Phe Phe Pro Trp Leu Pro 
15 10 15 

GAA CAC CCC ACT CAC CTG GTC TCC TCT ACC CCA CAG 230 
Glu His Pro Thr His Leu Vai Ser Ser Thr Pro Gin 
20 25 



(2} INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 base pairs 

{H) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi; ORIGINAL SOURCE: 

(A: ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ixl FEATURE: 

(A) NAME/iCEY: sig^peptide 
(3) LOCATION: 26,.ioO 

(C) IDENTIFICATION METHOD: Von Keijne matrix 
■D) OTHER INFORMATION: score 4.4 

sec AMGFLLMFDLTSQ/QS 

. 3r.Q'J£MCE DESCRIPTIOM: SEQ ID NO: 155: 



AAAAGGACAT rTrCTTCTGC AATCC ATG TTC CGG AGT CTC ACC ACT GCA TTT 52 

Met Phe Arg Ser Leu Thr Thr Ala Pno 
-25 -20 
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TTC AGA GA: GCC ATG GGC TTC TTA TTA ATG TTT GAC CTC ACC AGT CPA IOC 
Phe Arg As? Ala t<ec Gly Phe Leu Leu Met Phe Asp Le«i Thr S^r G\r. 
-15 -10 -5 

CAG AGC TTC TTA A.AT GTC AGA PJKC TGG ATG AGC CAA CTG CAA GCA PJ^r 148 
Gin Ser P.ie Leu Asn Val Arg Asr. Trp Met Ser Gin Leu Gin Aia Asn 
1 5 * 10 15 

GCT TAT TGT GAA AAT CCA GAT ATA GTA TTA ATT GGC AAC AAG GCA GAC 196 
Ala Tyr Cys Glu Asn Pro Asp lie Val Leu lie Gly Asn Lys Aia Asp 
20 25 30 

CTA CCA GAT CAG AGG GM GTC AAT GAA CGG CAA GCT CGG GAA CTG GCT 24 4 
Leu Pro Asp Gin Arg Glu Val Asn Giu Arg Gin Ala Arg Giu Leu Aia 
3*5 40 45 

GAC AAA TAT GGC ATA CCA TAT TTT GAA ACA AGT GCA GCA ACT GGA CAG 292 
Asp Lys Tyr Gly lie Pro Tyr Phe Giu Thr Ser Ala Ala Thr Giy Gin 
50 55 60 

AAT GTG GAG AAA GCT GTA GAA ACC CTT TTG GAC TTA ATC ATG NRG CGA 34 0 
Asn Val Giu Lys Aia Val Giu Thr Leu Leu Asp Leu lie Met Xaa Arq 
65 70 75 80 

ATG GAA CAG TGT GTG GAG AAG ACA CAA ATC CCT GAT ACT GTC AAT GGT 388 
Met Glu Gin Cys Val Glu Lys Thr Gin lie Pro Asp Thr Val Asn Giy 
35 90 95 

GGA AAT TCT GGA AAC TTG GAT GGG GAA AGC CAC CAG AGA AGA AAT GTA 436 
Gly Asn Ser Giy Asn Leu Asp Giy Glu Ser His Gin Arg Arg Asn Val 
100 105 110 

TCT GCT AGA 44 5 

Ser Ala Arg 

115 ■ 



[2) INFORMATION FOR SEQ ID NO: 156: 

U: SiOyENCE characteristics: 

(A) LENGTH: 319 base pairs 

(3) TYPI: NUCLEIC ACID 

iC) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECOLS TYPE: CDNA 

ivL) ORIGIMAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Testis 

(i::? TEATURE: 

;;-.) NA-VE/KEY: sig peptide 
:3) LOCATION: 1357.295 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
5 OTHIR IMcORMATION: score 4.4 

seq LSYAS3ALSPCLX/A? 

.:rQf;=:::CI description: SEQ ID NO: 156: 
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ATCACCTTC7 TCTCCATCCT TSTCTGGGC3 A37CCCCARC CCAGTCCCTC TCCTGACCTG 60 

CCCAGCCCA.A GTCAGCC7TC AGCACGCGCT TTTCTGCACA CAGATATTCC AGGCCTACCT 12C 

GGCATTCCAG GACCTCCGMA ATGATGC7CC AGTCCCTTAC AAGCGCTTCC TGGA7GAGGG 18C 

TGGC ATG 37G CTG ACC ACC C7C CCC TTG CCC TCT GCC AAC AGC CC7 G7G 229 
Met vai Leu Thr Thr Leu Pro Leu Pro Ser Ala Asn Set Pro Vai 
-35 -30 -25 

AAC ATG CCC ACC ACT GGC CCC AAC AGC CTG ACT TAT GC7 AGC TCT GCC 2 77 
Asn Met Pro Thr Thr Giy Pro Asn Ser Leu Ser Tyr Ala Ser Ser Ala 
-20 -15 -10 

CTG TCC CCC TGT CTG AHC GCT CCA AAG TCC CCC CGA CT7 GGG 319 
Leu Ser Pro Cys Leu Xaa Ala Pro Lys Ser Pro Arg Leu Giy 
-5 1 5 



(2) INFORMATION TOR S£Q ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESi) : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Honio Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 106.. 195 

(C) IDENTIFICATION METHOD: Von Heijiie cnatrix 

(D) OTHER INFORMATION: score <.3 

seq L1PTLPWLP3TRL/LS 

(>:i: SEQUENCE DESCRIPTION: SEQ ID NO: 157: 



AGCACAGCGC TGRRATGCCA GGTTCGGGTA GGAGGCCCCT TGGGGGRMNR ATTCTTTAGG 60 

AAATTCCTT7 AGAAGVAAAC AACTTGGGAC TGGATAGCGT GCGAT ATG GAG AGA AAT 117 

Met Gin Arg Asn 
-30 

GCA ACT 77C ATT CAT TTG CAG TTA GCG ATC CGC CCT TCC CTG CTC CCC 165 
Ala Thr ?r.^ lie His Leu Gin Leu Ala lie Arg Pro Ser Leu Leu Pro 
-25 -20 -15 

ACC CT7 Z:':. TGo CTC CCC AGT ACC CGC CTG CTG TCG CCC ACA CCC TTA 213 
Tr.r L-3'J t-.z Tro Leu Pro Ser Tr.c Arg Lsu Leu Ser Pro Thr Pro Leu 
-iO -5 L 5 

GGA CAG 7TT CGT GGC CCC CCG GGA CCG CAG AGG GCC ATG CCT ACC GCT 261 
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Giy Gin L;S- Ara Gly ?rc Pro Gly Xaa Gin Arg Aia Met Pro T-^r A" a 
1^- 15 2D 

CAT TTA AGA 27C 
His Leu Arg 

25 



(2) INFOSHATIDN FOR SEQ ID KO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNES3: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

:yi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Ovary 

(ix) TEAT'JRE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 50.. 94 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHEH INFORMATION: score 4.3 

seq ILFCFHSFHPLFQ/DT 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 



.ACATATATCT ATCCTGACAA TATTTAGCAG TTCAAAAGGT AATAAGATT ATG AAT ATA 58 

Met Asn lie 
-15 

TTA TT7 TGC TTT CAT TCT TTT CAC CCT CTA TTT CAA GAC ACT ATC G.AA 106 
Leu Phe Cys ?he His Ser Phe His Pro Leu Phe Gin Asp T.^r lie Giu 
-10 -5 1 

TTT ' 109 
Phe 
5 



(2) INFORiyiATION FOR SEQ ID NO: 159: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 371 base pairs 
(3) TYPE: NUCLEIC ACID 
;C) STPANDEDNESS: DOUBLE 
:D) TOPOLOGY: LINEAR 

'ii; n:OLECULE TYPE: CDNA 

ORIGINAL SOURCE: 
;A) ORGANISM: Homo Sapiens 
:T) TISSUE TYPE: Spleen 
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:ixr FEATURE: 

; ) N Af'iS / KI V : s 1 c_pe p t i de 
(^) r.OCATIONJ: }^H. '.?'-.! 

(C) IDENTIFICATION MITHOD: Von Hei-)ne natrix 
O) OTHER INFORMATICN: score 4.3 

seq FNFLFLVC'-CT LA/CD 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 



AAGATAAATT GGGGAATTC? AGGGAAACCC TTGAATACCA AGATAGAAAA. CTA.AAGTTTT 50 

TACTTCATTT GGTCATGGGA AACTTGCACT GAGCATGGGA GTCA-ATA-ATT AGA-AGCAAGT 120 

KAAATTCAA-A AAGTCGAACC CCATTCATAA AACCAGCTGA TAGTCTGAAA ATACGCTTTG 190 

AGCT.^AGCAA AGAATAC ATG TTG ACA AAT CGT AAC TAG TTT AAC TTC CTT 230 
Met Leu Thr Asn Arg Asn Tyr Phe Asn Ph.e Leu 
-20 -15 -10 

TTT CTT GTA CAA TTG TGC ATC CTG GCT TGT GAC AAT GCA TAC CTT CAG 278 
Pne Leu Val Gin Leu Cys lie Leu Ala Cys Asd Asn Ala Tvr Leu Gin 
-5 1 * 

TCG TGT CCC CTC ACC TCA AAG ACT CCT CTG TTA CAA ACC CAC TCT GCT 325 
Ser Cys Pro Leu Tnr Ser Lys Thr Pro Leu Leu Gin Thr His Ser Ala 
IC 15 20 

CTT TTC TAT A.AT AGT ACA TAT GGG ATT TTC CTA CTC CTA GGA GTG 371 
Leu ?he Tyr Asn Ser Thr Tyr Gly lie ?he Leu Leu Leu Gly Val 
25 30 35 



(2) INFORMATION FOR SEQ ID NO: 160: 

(1) SEQUENCE CHARACTERISTICS: 

:a; LENGTH: 363 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) 3TRANDEDNESS: DOUBLE 
(Dl TOPOLOGY: LINEAR 

.^1) MOLECULE TYPE: CDNA 

;vi.) ORIGINAL SOURCE: 

(A} ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

{L:<) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 1907.267 

(C: IDENTIFICATION METHOD: Von Heijiie matrix 

(D) OTHER INFORMATION: score 4.3 

seq ALCRFVGMQPCTA/QT 

3E:Q'oENCE DESCRIPTION: SEQ ID NO: 160: 



\.ATT::-.;.-.-.C .-.GTTGCATTG GAATGTAAGG TCAGGGCACC ACTGAGTTCA GTACTTCA.=iA 60 

Wilzz: :: tctacctctc cccagtgcac aaaaacactc tccacaccaa gctgctgctg 12o 
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CTGGGAT^ivj.-. GGGATGGCGT CASGATTCAA GACTGTTTTT CCTACCTGTT CAGCACTTCT 130 

TT^AGCGAr ATG AAG TTA AAT CCA GGC CA.H GTT CCC ACC TGG TGG GAA GCA 2J1 
Met Lys Xeu Asn Pro Gly Gin Vai Pro Thr Trp Trp Glu Ala 
-25 ' -2G -15 

CTG T3C AGG TTC GTG GGG ATG CAG CCC TGC AC A GCC CAG ACT GGA CTC 279 
Leu Cvs Arc Phs Val Glv Mec Gla Pro Cys Thr Ala Gin Thr Gly Leu 
-10 -5 1 

CTT CCC CAT GGA ACT CAC AAC ACA CGG GAG AGG CAG AGA GAT CCA AGC 327 
Leu Fro His Gly Thr His Asn Thr Arg Glu Arg Gin Arg Asp Pro Ser 
5 * 10 15 * 20 

GCA CAG AAA A-AC ACA AGA AGA TTC AGC CCT GTT GGG 363 
Ala Gin Lys Asn Thr Arg Arg Phe Ser Pro Val Glv 
25 30 



(2) IMFORMATIOM FOR SEQ ID NO: 161: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 base pairs 
(S) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

ivi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

{ix! FEATURE: 

lA) NAME/KEY: sig__peptide 
■3) LOCATION: 97. .177 

(C) IDENTIFICATION METHOD: Vcn H-ijns matrix 
CD) OTHER INFORMATION: score 4.3 

seq 1CLNLCPC3SSLL/SP 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 



ACTCTTGTAG TGGMGCCGGC TTGCATCCCA GGTCGTGGCG GTTTTGGTGC CTGAAGCAGG 60 

GAGCGCGGAG TCGTTCCCGA GAGAGGCGGC CAGGCT ATG CTC GCC GGT TTC CGG 114 

My- Leu Aid Gly Axy 
-25 

CGT TCC GCT CCG GCC AGC CAG AGT CTC TGT CTC .AP-.C CTG TGT CCG TGC 162 
Arg Ser Ala Pro Ala Ser Gin Ser Leu Cvs Le'j Asn Leu Cys Pro Cys 
-20 -15 ' -LO 



TCC kZZ A3T CTC CTC AGC CCG GCG 
S-si S-jr. 3-*r Leu Leu Ser Pro Ala 



186 
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(2; INT0EU--:ATX0N for 3L0 ID NC: 162: 

;i) SiOUENCE CH.^UU^.CTERISTICS: 

(A;' LENGTH: 311 base pairs 
(D) TYPE: NUCLEIC ACID 
(C; STRANDE0NES3: DOUBLE 
iO) TOPOLOGY: LINEAR 

iii) XOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hoir.o Sapiens 
{?) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 237/.290 

{C) IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score 4.3 

seq 3FYLLFFLNDVPP/CP 

(xil SEOCJENCE DESCRIPTION: SEQ ID MO: 162: 



ACAGRACAGG TTAAAGAGAT AATCATTTGG GACTCAAATG TCTCTCCCCC CGGGCACTTG 60 

CATATGGGAC ATTGAGTCCT TTTGTTTTCC CTTGATCTAT AGCTCTTACC CCTCTGCCCA 120 

GTAATTCCCT GAGGAAGAGG TAAAGATCAR AGTTGRTACT TTGTCCTTTC CTTCCKTCTT 180 

CCCTTATTTT TAAAGCTGTC RSCCACACTG ATTCCTGCTC TAATAGCAGA GCAGAG ATG 239 

Met 

AAG GA.A GGA GCT TCC TTC TAT CTG CTT TTC TTT CTC AAT GAT GTC CCA 287 
Lys Glu Giy Ala Ser Phe Tyr Leu Leu ?he Phe Leu Asn Aso Vai Pro 
-■5 -10 -5 

CCA TG7 CCC CCT CAC ACC CCC GGG 311 
Pro Cys Pro Pro His Thr Pre Giy 
1 5 



(2) IMFORMATION FOR SEQ ID NO: 163: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) 3TRANDEDNESS : DOUBLE 
(0) TOPOLOGY: LINEAR 



;iij MOLECULE TYPE: CDNA 



•.vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 



ti:;; FEATURE: 

(A) NAME/KEY: sig_pepticie 
(3) LOCATION: 30 5.. 3 91 

(C) IDE\'TIFICATIOM METHOD: Vor. Hci^ne siatrix 
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:C) OTHER INTORMATION: score 4.S 

seq ETLLLKL3SC3RT/NR 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 



ATATTTCA3C TTCCACATTT TATTTCAACA ACATTAGTCA TCGTAGCTGC GTATTCCT3T 60 

THTCAGTGTA GTAACGTTGA GCAHTTATGT TCCTAGCACT CTTCCAGGTA CCOTCTGCGT 120 

TATCAGGCAG GCACATCTCT CCTGAAAGAA TTTATATTCT TGTCAGGGA.A AT-SlAGGCTTC 13C 

AGATA.A3A-AA AAATTCGGGG GAAAGTGCCT AATTCCTTCT ACCCTAACCT GCCTCCATTT 24C 

CCTCCCTCCT CCGAGTTGAG ATGATTGGGT CAGAGCCAGC TCTTCCTGGG CTTGGGAA.GA 300 

GGAG ATG GGG CTT GAG TGC TGC TGC CCC CCT CAT AAC CTC AGA GTC TAT 34 9 
Met Giy Leu Glu.Cys Cys Cys Pro Pro His Asn Leu Arg Val Tvr 
-25 -20 -15 

ATT GAG ACT CTC TTG CTC AAA CTC TCC TCG CAG AGT AGA ACG AAC AGG 397 
He Giu Tnr Leu Leu Leu Lys Leu Ser Ser Gin Ser Arg Thr Asn Arg 
-10 -5 1 

CTG 400 
Leu 



(2) INrORMATION TOR SEQ ID NO: 154: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 376 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

ilK) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: 2757.337 

(C) IDENTinCATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4,2 

seq VLSIAASLLQCRL/AV 

(xi; SEQUENCE DESCRIPTION: SEQ ID NO: 164: 



.-•-'.tctgaaac agttctagtc tcaagcattt tggatgaggg atacccatcc tctatttta? 60 

':.-.ccattt7c atgc7gtatt aaaatgaaat tgccaactca gttcaaagga atttttcttc 120 

t7.\gc7ttac attgttgatt catggtggac ccgaacaaac tatcgac7gg tgggttggat 100 

.-.::t':3gtc: agagaggtcc ttgtgacata tctcatggcc cattacctag gtgatctgag 2ao 
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LCTT3CCTTC. TGCCTGCAAT AAA3TTT7GT TGGA ATG CAG CTA TCC CCA TTT ACT 2?5 

Met Gin Leu Cys frs Phe Thr 
-20 -ir 



AG? GTA TTG TCC ATA GCT GC7 TCT CTG 
Ser Val Leu Ser He Aia Ala Set Leu 



CTA CA.A TCT AGA TTA CCA GTT 3-13 
Le-j Gin Cys Arg Leu Ala Val 

1 



GTA ACA GAG ACT ATA TGG CCC CCC CAG VNT TGG 
Val Thr Glu Tnr lie Trp Pro Pro Gin Xaa Tro 
3 10 



375 



:2) IMFORMATIOM FOR SEQ ID NO: 165: 



(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A: ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 



(iz) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 1397.270 

(C) IDENTIFICATION METHOD: Von Heijne macn.x 

(D) OTHER INFORMATION: score 4.2 

seq QLLFKLNSTWCRA/LQ 

(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 



A.:CCCYA3.-A GTTATAAGGA AASGCCTTCC AACTTGATAC AGTTGCTTTT CTTTCCTGAA 60 

TrCCCTGTTT ACTGGAAATT TCATTGGATT TTGGGAGGAG AGAGGTCTGA AGG.AAGGAAA 120 

GGCCTGTTTT CTGCTGTA ATG GAT GTA ACA TGC TGC TTT GAT GCA GTT GAA 171 
Met Asp Vai Thr Cys Cys Phe Asp Aia Vai Glu 
-40 -35 

GCA AGT GAC TTC AGG GTT TGC TGT CAT GGA TGC GTG TCT TGG CTG TGT 219 
:iy S-3r Asp Phe Arg Val Cys Cys His Giv Cvs Val Ser Trp Leu Cys 
-30 -25 * ' -20 

::C CAG ATG CTG CAG CTT TTA TTC AAG CTT A.AT AGC ACT TGG TGC AGA 267 
l-j.i Zip. Met Leu Gin Leu Leu Phe Lys lau Asn Ser Thr Trp Cys Arg 
-15 -10 -5 

:':a ctc cag agt gaa acc tca ttg gct tcc cgg cgc ctg tgg atg tgg 3 is 
Lej :-la Ser Giu Thr Ser Leu Aia Sar Arg Arg Leu Trp Me^ Trp 
: 5 10 15 

TTT :at ctg ayg gag ttc ttt act ctg acc ccc tgg 354 

■ o-rr Hi 3 Leu Xaa Giu Phe Phe Thr Val Tp.r Pro Trp 
20 25 ■ ' 
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(2; INFORM^iTION TOR SEQ ID NC: 156: 

{i) SEQUENCE CHARi.CTERISTICS: 

{A) LENGTH: 84 base pairs 
(3) TYPE: MUCLEIC ACID 

iZ] STR.AKDEDNE3S: COUBLE 
(D) TOPOLOGY: LINEAR 

iii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 7. .72 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFOP^MATION: score 4.2 

seq HCFCFTLFSYSSS/FF 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 



AAGGAC ATG AGA CAA GGA CCT GGG GCC CCA CTC CAT TGC TTC TGT TTC 4 8 

Met Arg Gin Giy Pro Gly Ala Pro Leu His Cys Phe Cys Phe 
-20 -15 -10 

ACC CTT TTT TCC TAC TCC TCC TCC TTT TTT TTT TTT 84 
Thr Leu Phe Ser Tyr Ser Ser Ser Phe Phe Phe Phe 
-5 1 



(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

{ii) MOLECULE TYPE; CONA 

•;vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 72.. 116 

(C) ID£NTI^IC.^TION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.2 

seq ITLLGIWLTXRLQ/FP 

SEQUENCE DESCRIPTION: SEQ ID NO: 167: 
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ACAAGCCCCO GGCTTGCTCA TTTCATCCAG GTGAGGAGTC TGGAGTAGAG CAGGGCTTCT 60 

G.^^TGGTGA C ATG GAG ATC ACT CTC CTG 3GC ATC TGG TTA ACA KGC AGG lid 
Met His lie Thr Leu Leu Giy lie Trp Leu Thr Kaa Arg 
-15 . -10 -5 

CTC CAG TTC CCC AGG TCT GGG CGG GCT GGG 140 
Leu Gin ?he Pro Arg Ser Giy Arc Ala Giy 
1 5 



(2) INF0P>L2^TI0N FOR SSQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 base pairs 

{B) TYPE: NUCLEIC ACID 

{C) STRANDEDNESS: DCU3LE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) . NAME/KEY: sig_peptide 

(B) LOCATION: 24 5.. 295 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.2 

seq SWVCLLSAGTAFE/DY 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 



ATTTAGGATT TTAGACTTTA GGGATTTTGA TCTTTGGGGA TTTCAATATT TGGGATTATG 60 

GTATTTGAGA TGGTCTCTTT TAGGATTATG ATCC.AAACCC ATCTCAGGAA TGTGTGAAAT 120 

TTACAGTAGT CCATCCCCAT CCCGGGCTGT AGA-:i-A':GTAG GACCCACAAG CCTTCGTTAC 130 

AGAGCCACTT ACTGCCCCAT GGAGTTCCCA GGTAGATGAC AGTAGCGGGG AGGATACATG 24 0 

GCAC ATG TTA TAT GGC TCT TGG GTG TGC C7T CTC TCA GCA GGC ACT GCC 239 
Met Leu Tyr Giy Ser Trp Val Cys Leu Leu Ser Ala Giy Thr Ala 
-15 -10 -5 

rr? GAA GAT TAT CAT TTG GGG GGT ACG 31G 
?he Glu Asp Tyr His Leu Giy Giy Thr 
1 5 



;21 I>3F0Rv^T:0M for seq id NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 base pairs 
(3) TYPE: NUCLEIC ACID 
(C) STRANDECNES3: DOUBLE 
iOi TOPOLOGY: LINEAR 
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111) MOLECULI TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(i::) FEATURE: 

(A) NAME /KEY: siq_peptid9 
{B) LOCATION: 59. .154 

(C) IDENTIFICATION t>!ETHOD: Von Hei^ne mairix 
:D) OTHER INFORMATION: score 4.2 

seq XXXXFLLGRRVVG/ES 

(>:il SEQUENCE DESCRIPTIOH: SEQ ID NO: 159: 



ACTATTTTTC CCTTCATTGT C7TTACTTTG CTTCAGAAGA ATAGTCTGTG ATGACGCC 53 

ATG T?A ?TT TTT CCC CTT CTT TC? T?C CGA TTT CTA CCC TCA GAG AGT 105 
Met Leu ?he Phe Pro Leu Leu Ser Phe Arg Fhe Leu Pro Ser Giu Ser 
-30 -25 -20 

TTG TTG AAA GKC 3TA WTG SYT TTT TTG CTG GGG AGG AGG GTA GTA GGA 154 
Leu Leu Lys Xaa Xaa Xaa Xaa Phe Leu Leu Gly Arg Arg Val Val Gly 
-15 -10 -5 

GAA TCA CNT TTT ATT TTC ACA TGT GGA AAT TTG CTT TTA ATT TGG CCT 202 
Glu Ser Xaa Phe lie Phe Thr Cys Gly Asn Leu Leu Leu lie Trp Pro 
15 10 15 

TAG GGG 208 
Tyr Gly 



(2) INFORMATION FOR SEQ ID NO: 170: 

Ui SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRA^DEDNESS : DOUBLE 
(0) TOPOLOGY: LINEAR 

(Li) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

<AJ NAME/KEY: sig^peptide 
(3) LOCATION: 113.. 160 

(C) IDENTIFICATION METHOD: Von Heijne natrix 

(D) OTHER INFORMATION: score A. 2 

seq WAILGCWG?LSRG/HL 

;:;i) SEQUENCE DESCRirTICN: SEQ 10 NO: 170: 



wo 99/06549 



127 



PCT/IB98/01231 



GAGTTGAAAC A.^\AA.ViC7A CATGGAGGTC GA.nCCTGCCA GCCCAG7GGT GG ATG CCA 113. 

Hrit Pro 

GTC TGG GCC ATA CTG GGC TGC TGG GGC ACA CTC AGC AGG GGA CAT CTG 156 
Val Trp Ala lie Leu Gly Cys Tro Glv Thr Leu Ser Arg Giy Kis Leu 
-10 -5 1 

CCT GTG TCC TT3 GAC CCA AAG 137 
Pro Val Ser Leu Asp Pro Lys 



(2) INrORMJlTION FOR SEQ ID NO: 171: 

(i; SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 
(0) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Komo Sapiens 
(?) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 134.. 247 

fC) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.1 

seq GILCGSLPGPSLC/PP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 



ACAAGTTCAC CTGGCCTCCT CTTCTCCAGC CTCAGTCACC TTCTGCTGAA CAGCTCCACC 50 

TTGGCCTTGC TTACTCACAG ACTAAGCCAG ATGACCTGCC 7GCAGAGCCT CAGGTGAGTG 120 

ACCGAGCGGC CCC ATG GGA ATG ACT GGG A-AG AAA CAC TTC CCA CTC AGT 159 
Met Gly Met Ser Gly Lys Lys His Phe Pro Leu Ser 
-35 -30 

T3G GAC CAC ATC CAG GGA AGC ACT GAG GCC ACC TCC CAG GGG ATC CTT 217 
Trp Asp His lie Gin Giy Ser Thr Glu Ala Thr Ser Glii Gly lie Leu 
-25 -20 -15 

TGC GGA TCC CTC CCA GGC CCA TCC CTG TGC CCT CCG 25 3 

Cys Gly Ser Leu Pro Giy Pro Ser Leu Cys Pro Pro 
-iO -5 1 



:2] INrOR:<.ATION for SEQ ID NO: 172: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 base oairs 
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.- -a) ty?h:; nucleic acid 

•C) STRANDEDNES3 : DOUBLE 
iC) TOPOLOGY: LINEAR 

(ii; MOLECULE TYPE: CDNA 

: VI ) ORIGINAL SOURCE: 

{A} ORGANISM: Hcmo Sapiens 
{r\ TISSUE TYPE: Testis 

(i:v) FEATURE: 

(A: MAME/KEY: sic_pe?tide 
(3) LOCATION: 141.. 251 

(C) IDEMTiriCATION METHOD: Von Heijne matrix 
{D; other INFORMATION: score 4 

seq PLSLDCGHSLCRA/CI 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 



AACACCCACC CTGGCTTTTC TTCACCTCTT CA.ACCAGGAG CCGAGATTTC TGTTGCTCTG 60 

A.AGCCATCCA GGGGTCTTTA ACCAGAAGAG AGAGGAGAGC CTCAGGAGTT AGGACCAGA.^. 120 

GAAGCCAGGG .A.^.KCAGTGCA ATG GCT TCA AAA ATC TTG CTT AAC GTA CAA GAG 17 3 

Met Ala Ser Lys He Leu Leu As.n Val Gin Glu 
. -35 -30 

GAG GTG ACC TGT CCC ATC TGC CTG GAG CTG TTG ACA GAA CCC TTG AGT 221 
Glu Val Thr Cys Pro He Cys Leu Glu Leu Leu Thr Glu Pro Leu Ser 
-25 -20 -15 



CTA GAC TGT GGC CAC AGC CTC TGC CGA GCC TGC ATC ACT GTG AGC AAC 269 
Leu Asp Cys Giy His Ser Leu Cys Arg .Ala Cvs He Thr Val Ser Asn 
-10 -5 1-5 

AAG GAG GCA GTG ACC AGC ATG GGA GGA AAA AGC AGC TGT CCT GTG TGT 317 
Lys Glu Ala Val Thr Ser Met Gly Gly Lys Ser Ser Cys Pro Val Cys 
10 15 20 

GGT ATC AGT KAC TCA KTT GAA CAT CTA CAG GCT AAT CAG CAT CGG 362 
Gly He' Ser Xaa Ser Xaa Glu His Leu Gin Aia Asn Gin His Arg 
25 30 35 



(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

Ui) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

ik) ORGANISM: Homo Sapiens 
TISSUE TYPE: Testis 

i\:<: 7-at:ir?:: 
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(A) NAME/ KEY: sig_pepzide 
(3) LOCATION: 46. ,89' 

(C» IDENTinCATION METHOD: Von He\3ne matrix 
(D) OTHER INrORt^IATION: score 4 

seq YYMVCLFFRLrrS/EH 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



AGGAGATAGC CTCGTAGAAA TGACAACCAC AATGTTAATA CTAACAT AT3 TAT TAC 56 

Met Tyr Tyr 

ATG GTT TGT T7G ?TC TT7 CGC TTA ATA TTT TCA GAG CAC CTA CCT ATT 104 
Met Val Cvs Leu Phe Pne Arg Leu lie Phe Ser Giu His Leu Pro lie 
-10 " -5 15 

ATA GGC ACT GTC ACT TGT CAC AAA ACT GGG ACA GGG 140 
lie Gly Thr Vai Thr Ser His Lys Thr Giy Thr Gly 
10 15 



(2) INFORi^.ATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

{A} LENGTH: 158 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: 15.7l22 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4 

seq KLAGLWSPGLVPA/AP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 



AAGTGTCGCG ATAA ATG GGC GCC GGC GGA SGG AGG GAG ATC CGA GCG GCG 50 
Met Gly Ala Giy Giy Xaa Arg Giu lie Arg Ala Aia 
-35 -30 -25 

GCG GCA AGC TGG CTG CGA GCG CCT GAG CAC TCC AAG CTC GCC GGC CTT 93 
Aia Ala Ser Trp Leu Arg Aia Aia Giu His Ser Lys L«u Ala Giy Leu 
-20 -15 -10 

TGG TCT CCA GGA CTT GTC CCA GCA GCC CCT CGA ACT GAG AAT TAC ACC 14 6 
Trp Ser Pro Giy Leu Vai Pro Aia Aia Pro Arg Thr Giu Asn Tyr Thr 
-5 1 5 

ATC GGA ZZZ CTG . 158 

lie Giy Pro Leu 
10 
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■2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A; LENGTH: 291 base pairs 

(3} TYPE: NUCLEIC ACID 

(C) S7RANCEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kono Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

{A) NAME/KEY: sig__peptide 

(B) LOCATION: 52.. 231 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4 

seq LVRRTLLVAALRA/WM 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 



AAGGAAACAG CAACCAGAGG GAGATGATCA CCTGAACCAC TGCTCCAAAC C ATG GGC 57 

Met Giy 
-60 

ACT AAA TGC TGT AAA GGT GGT CCA GAT GAA GAT GCA GTA GAA AGA CAG 105 
Ser Lys Cys Cys Lys Gly Giy Pro Asp Glu Asp Aia Val Glu Arg Gin 
-55 -50 -45 

AGG CGG CAG AAG TTG CTT CTT GCA CAA CTG CAT CAC AGA AAA AGG GTG 153 
Arg Arg Gin Lys Leu Leu Leu Aia Gin Leu Kis His Arg Lys Arg Vai 
-40 -35 -30 

AAR GCA GCT GGG CAG ATC CAG GCC TGG TGG CGT GGG GTC CTG GTG CGC 201 
Lys Ala Aia Giy Gin lie Gin Aia Tr? Trp Arg Gly Val Leu Val Arg 
-25* -20 -15 

AGG ACC CTG CTG GTT GCT GCC CTC AGG GCC TGG ATG ATT CAG TGC TGG 24 9 
Arg Thr Leu Leu Vai Aia Aia Leu Arg Ala Tro Met lie Gin Cys Trp 
-10 -5 1 5 

TGG AGG ACG TTG GTG CAG AGA CGG ATC CGT CAG CGG CGG CAG 291 
Trp Arg Thr Leu Val Gin Arg Arg lie Arg Gin Arg Arg Gin 
10 15 20 



(2) INFORMA.TION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 base pairs 
(3) TYPE: NUCLEIC ACID 
(C) STRANDEDNESS: DOUBLE 
(0} TOPOLOGY: LINEAR 
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iii-'" MCLECULH: TYPE: CDNA 

(vi; ORIGINAL SOURCE: 

lA) ORGANISM: Homo Sapiens 
{?) TISSUE TYPE: Ovary 

(i.x) FEATURE: 

(A; MAME/KEY: sig^peptide 
(3} LOCATION: 103. .180 

(C: IDENTIFICATIOM METHOD: Von Heijne matrix 
(D) OTHER INFORKPiTION: score A 

seq SIHSWQLLTSAQP/QQ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 



CGGCAACGCG CGGCGGCTCA ACGGCCTGGC AGAGGTTTCA GCGCTGAGCA GGCCTGAGTT 60 

CCGTCATGGC CCTCTATTAT GACCACCAGA TAGAA.GCCCC GG AT3 CAG CAG GGT 114 

Met Gin Gin Giy 
-25 

CAC CCT CAT TTA TCA GCT GGC ACC CTG TCC ATC CAT TCT TGG CAG TTG 162 
His Pro His Leu Ser Ala Giy Thr Leu Ser lie His Ser Trp Gin Leu 
-20 -15 -10 

CTT ACA TCA GCA CAA CCT CAA CAG GCA GGG 192 
Leu Thr Ser Aia Gin Pro Gin Gin Aia Giy 
-5 1 



(2) INFORMATIOM FOR SEQ ID NO: 177: 

{ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

{ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(D) LOCATION: 1..14 7" 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4 

seq ATCCLSLFQWCAV/LR 

(>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 



ATG TCT AGA TAT GAG TMA GGA TCC TCC TTA TTG 
Xet Ser Arg Tyr Giu Xaa Giy Ser Scr Leu Leu 
-45 -40 



CCA TTT CCA GAC CAT 
Vro Vhe Pro Asp His 
-35 



43 



TTC 



rAC TCC TTT AAA ASA RAT AGT TTT TTT GAA CCC TAC ACC 



06 , 
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Phe Ser Val Ty; Ser Phe Lys Xaa Xaa Ser ?he Phe Glu Ala Tvr Ser 
-30 -25 -20 

ATT TCA GAT TAT GCC ACC TGC TGT CTC TCC TTA TIT CAG TGG TGT GCA 
lie Ser Asp Tyr Aia Thr Cys Cys Leu Ser Leu Phe Gin Tro Cvs Ala 
-15 -10 -5 



GTT CTG AGA TTC CTG TGT CTG CCC CTT CCG 
Vai Leu Arg Phe Leu Ser Leu Pro Leu Pro 
1 5 



174 



(2) INFORMATION FOR SEQ ID MO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNE3S: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

lix) FEATURE: 

(A) NAt<E/KEY: sig_peptide 
(3) LOCATION: 140.. 211 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.9 

seq LLLHHYLLLFITT/SR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 



ACAGTTGTGG CTCTCAACTC TCCTTTTTGT GTACTGCTAT ACTTGAGTAG CACACAGCCA 60 

TACCAATT7C CAGGGTGCTC AGATTCATTC TACCCTTTCC TACTGGAAGA GGTAAAAAAG 120 

CAACACCCTA GAATCTGAT ATG ATT TAT TTT ATC AAA ATA AAC AAT AAG CTA 172 

Met lie Tyr Phe lie Lys lie Asn Asn Lys Leu 
-20 -15 

CTG CTT TTG CAC CAT TAC TTG CTT CTA TTT ATA ACA ACC TCT CGC CCC 220 
Leu Leu Leu His His Tyr Leu Leu Leu Phe lie Thr Thr Ser Arg Pro 
-10 -5 1 

ACA GGG 226 
Thr Giy 
5 



{2i INFORMATION FOR SEQ ID NO: 179: 

(i: SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 base pairs 
{B) TYPE: NUCLEIC ACID 
:C) STRANDEDNE33: DOUBLE 
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(D) TDPOLOGV: LINEAR 

(ii) MOLECULE TYPE: CDKA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE; 

(A) NAME/KEY: sig_pepcide 

(B) LOCATION: 28.. 108 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.9 

seq LSWALCLSQSGYY/HP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 



AGGGTATATT TCNTGTCCCC TAGGAGC ATG GAG CTT TTG TAC CTT AAA GTT AAG 54 

Met Giu Leu Leu Tyr Leu Lys Val Lys 
-25 -20 

AGA GGA CAA AAG GAT CTG AGC TGG GCT :T3 TGC CTT TCC CAG AGT GGT 102 
Arg Giy Gin Lys Asp Leu Ser Trp Ala Leu Cys Leu Ser Gin Ser Gly 
-15 -10 -5 

TAT TAC CAC CCT TCC CAC CCC CAT TGG 129 
Tyr Tyr His Pro Ser His Pro His Trp 
1 5 



(2 J INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 36.. 77 

(C) IDENTIFICATION METHOD: Vcn Heijne matrix 

(D) OTHER INFORMATION: score 3.9 

seq TLAVTLSALGATG/LF 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 



AGAGGCCAGA T7MTGCAGGC CTGTGGGCTG ACACA ATG ACT TTG GCT GTT ACT 53 

Met Thr Leu Ala Val Thr 
-10 

CTG AGT GCA TTG GGG GCC ACC GGA TTG TTT i.a.G v':;AG GCT TGT G.AT CTA 101 
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Leu Ser Ala Leu Gly Ala Thr Glv Leu 
-5 ' 1 

ACC TTT TTA AAC ATA GOT CAG ATC ACA 
Thr Phe Leu Asn lie Giy Gin He Thr 
10 15 

GGC CCC CAG 
Glv Pro Gin 
25 
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Fhe Lys Glu Ala Cys Asp Leu 



AGC YTC CTT PAA CAA TCC GGT 14 9 
Ser Xaa Leu Lys Gin Ser Giy 
20 

158 



(2) INFORMATION FOR 3EQ ID NO: 181: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOaSLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: 1157.237 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(0) OTHER INFORMATION: score 3.9 

seq CRCLITLPRSCRP/ST 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 



ATTTGACGTG TCTGTTTCAT GTYTCCTTTG AGTAAAACCT AATCTTTCTC AATAGAGAAG 60 

TTTATTCTTG AAGTATGTGK KCTCAGTTCA TTCSCCTGAG TGACACAAGC TCCC ATG 117 

Met 

CTT GGG CCA CCC TTG CAG CCC GGA AGC CAT GGG AAG GTC CTC GCC CCT 165 
Leu Giy Pro Pro Leu Gin Pro Gly Ser His Giy Lys Val Leu Ala Pro 
-40 -35 -30 -25 

CAG GGC AGT AGT GGC CTG ACA CCC CCC TIC COG TGC AGG TGT CTG ATA 213 
Gin Gly Ser Ser Gly Leu Thr Pro Pro Phe Pro Cys Arg Cys Leu lie 
-20 -15 -10 

ACT CTG CCG CGG TCG TGC CGG CCC AGT ACA 7CT GTG CCC CGG RCA GCA 261 
Thr Leu Pro Arg Ser Cys Arg Pro Ser Thr Ser Val Pro Arg Xaa Ala 
-5 1 5 

AGC ACA CGT TCC TCG CAG CGC CCG SSC AGC TCC TGC TGG MGA AGT TCC 309 
Ser Thr Arg Ser Ser Gin Arg Pro Xaa Ser Ser Cys Trp Arg Ser Ser 
10 15 20 



TGC AGC ACC ACA GCC ACC ATG 
Cys Ser Thr Thr Ala Thr Met 
25 30 



330 
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(2) INFORMATION FOR SSQ ID NO: 182: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: D0U3LE 

(D) TOPOLOGY: LINEAR 

(il) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 64.. 144 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.8 

seq QLXLILVHFPAYS/VE 

fxi) SEQUENCE DESCRIPTION: SEQ IP NO: 182: 



AA.i^CTGCCAT CYGCAACTGA ACTTTGGCAG TAAACACAGC TTAGTTGTCT CAGAGGATTC 60 

ACA ATG GGA AAT GTT TGT AGT TGC TGC CTC AGA GCA AG A TAT CAR CAG 108 
Met Giy Asn Val Cys Ser Cys Cys Leu Arg Ala Arg Tyr Gin Gin 
-25 -20 -15 

TIG DCT TTA ATT TTA GTT CAT TTC CCA GCA TAT TCT GTT GIKA GAT CAA 156 
Leu Xaa Lsu lie Leu Val His Phe Pro Ala Tyr Ser Vai Glu Asp Gin 
-10 -5 1 

AGA GTG G.^T CCT GGG GTG CCA GGG GAA TCC ACC GTC TGC CAC CAC AAT 204 
Arg Val As? Pro Gly Val Pro Gly Glu Ser Thr Vai Cys His His Asn 
5 10 15 20 

CGG ' 207 
Arg 



(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 base pairs 

(B) TYPE: NUCLEIC AGIO 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Komo Sapiens 
(F) TISSUE TYPE: Spleen 

(i>:; FEATURE: 
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-- (A) NAME/KEY: sig_neptide 
(B) LOCATION: 9.. 70 

iC) IDENTIFICATION METHOD: Von H5»ijn?* x^r-ix 
(D) OTHER INFORMATION: score 3.3 

seq PRCVISCIKGVWC/EE 

■xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 



ACTCTGC ATG CTT TAT GGC CTT GGC TCT GGG CCA AGG TGT GIG ATC TCC 4 9 

Met Leu Tyr Gly Leu Gly Ser Gly Pro Arg Cys Val I»» S»r 
-20 -15 -10 

TGC ATT CAT GGT GTG TGG TGT GAG GAG GGG GAT GGG TCC CTG CCC CGT 9"? 
Cys :ie His Gly Val Trp Cys Glu Glu Gly Asd Gly Ser Leu Pro Arg 
-5 1*5 

CTG CAC GTG GCC CTC ATG ATT CCC GCG CTA GGG 130 
Leu His Val Ala Leu Met He Pro Ala Leu Gly 
10 15 20 



(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 298 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A} NAME/KEY: sig peptide 

(B) LOCATION: 62.7X87 

(C) IDENTIFICATION METHOD: Vor. Heijne matrix 

(D) OTHER INFORMATION: score 3.8 

seq VTPLDSCPPSAHS/AP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 



ACAGCTTCCA CTCTTGTCTC CCTAAACCCT GTTTTCCTCA CAGTAACTAG AATTGTCCTT 60 

A ATG CAT AGA ATC ATG ACT CTC CTT CAT CTC AAA GCT CTC CAA CAG CTT 109 
Met His Arg He Met Thr Leu Leu His Leu Lys Ala Leu Gin Gin Leu 
-40 -35 -30 

CAG AAT AAA ATC CAT GTC CCC AGG ATG CTC CCA GGG CCT GTG ACC CCT 157 
Gin Asa Lys lie His Val Pro Arg Met Leu Pro Gly Pro Val Thr Pro 
-25 -20 -15 

CTG GAC TCA TCC CCT CCT TCT GCT CAT TC7 GCT CCA TCA CTG CT:: ACT 205 
Leu Asp Sar Cys Pro Pro Ser Ala His Ser Ala Pro Ser Leu Leu Thr 
-10 -5 I 5 
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?CC CAG GTA CCC CTC C.^lA CAC ACC AAT GCG CrC CCA CCT CAC GGC C7C 



Ser Gin 



'ro Leu Gin His 

10 



Thr Asn Ala Pro Pro Pro His Giy L€ 
l=> 20 



TCC CTG CGC CGT GCC CTC CAC TGG ATT GCC CTT CCC T?G ATG GGG 
Ser Leu Arg Arg Ala Leu His Trp He Ala Leu Pre Leu Mer Giy 

30 35 



:93 



25 



(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CONA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F; TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: 93.7131 

(CI IDENTIFICATION METHOD: Von Heijne natrix 
(D) OTHER INFORMATION: score 3.8 

seq MLFLVLFYSAIFL/FT 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 



AAACAACAAA AAAAAGTTTA AAAATTGGAA ACCACCAAAA GGTAGTATTA AAAGGGAAAT 60 

AAAAATTACT CATAATCCCA GAACGCAGTC AT ATG CTA TTT TTA GTC TTA TTT 113 

Met Leu Phe Leu Val Leu Phe 
-10 

TAT TCA GCC ATT TTT CTC TTT ACA CTA ACT TTT TTT 14 9 

Tyr Ser Ala lie Phe Leu Phe Thr Leu Thr Phe Phe 
-5 15 



(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(r) TISSUE TYPE: Ovary 

{ixl FEATURE: 
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(A) NAT-IEI/KEY: sig^peptide 
-- (3) LOCATIOM: 133/. 174 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
{D) OTHER INFORMATION: score 3.8 

seq VSLCVAALFPLQA/rG 

(xii SEQUENCE DESCRIPTIOM: SEQ ID NO: 186: 



AAAGAGTACC TG.AAAACCTT AGAGAACCCT GGGGAAATAT TTATAGCCAG GCTTCTTGGA 50 

GACTCTGGGA ACAGGAAAGT CAGGAACCCT GCCTTTCAGG AACTGCTGTA TC7CAGTCGM 120 

MTTCTTCATT TC ATG GTT TCT CTC TGT GTA GOT GC7 T7A TTT CCT CTT CAG 171 

Met Vai Ser Leu Cys Val Ala Ala Leu Phe Pro Leu Gin 
-10 -5 

GCT TAG GGG 180 
Ala Tyr Gly 



(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 base oairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEONESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME /KEY: sig^peptide 

(B) LOCATION: 218.. 268 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.7 

saq LFYIPSILTLLLA/CR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 



AAAATTCTTC CASAATGCTA ATGTAAATCT AATCAGCCTT TAGAATTTAA AGGCTTAAAA 60 

AAGACTAAAG AAAAGTAACA ACCAAATGCA ATATGTAGAA CTTATATGGA GCCTGATTCG 120 

AACATCAAGT ATAAAGAGA.T ATTTTTGAGA AAATTGAGAA ATTTTAAAAC ATGAMATBAG 180 

TATTATATGA TATTGAMGAC TGCTGCTTTT TCAMGAC ATG TCC TCA AAT TTA TTT 235 

Met Ser Ser Asn Leu Phe 
-15 

TAC ATT CCT TCC ATA CTA ACT CTT CTC CTT GCA TGT MGA CAG ACA GGG 28 3 
Tyr He Pro Ser lie Leu Thr Leu Leu Leu Ala Cys Arg Gin Thr Gly 
-10 -5 15 
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{2'» IMrORMATION FCR SEQ TD NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 121 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: D0U3LE 

(D) TOPOIOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(i:<) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 2.. 106 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.7 

seq IKQFILCLGTCRG/EM 

(xi) SEQUENCE DESCRIPTION: SEQ ID KO: 188: 



T ATG GGG CTT TTG AGA AAG TGT TTT CCC GTG ATG CTG GGG GGA AAC ACA 4 9 
Met Gly Leu Leu Arg Lys Cys Phe Pro Vai Met Leu Gly Gly Aan Thr 
-35 -30 -25 -20 



CAT ATT CAA ATT ACT TGT ATA AAA CAG TTT ATT CTG TGT TTA GGA ACT 
His He Gin He Thr Cys He Lys Gin Phe He Leu Cys Leu Gly Thr 
-15 -10 -5 



97 



TGT AGG GGT GAA ATG CTG ACC AGG 
Cys Arg Gly Giu Met Leu Thr Arg 
1 5 



121 



(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 56.797 

(C) IDENTIFICATION METHOD: Voa Heijiie matrix 

(D) OTHER INFORMATION: score 3.7 

seq MLPLFCSPV7ESGG/RT 
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(xil_ SEQUENCE DESCRIPTION: SEQ ID NO: 189: 



TAAGCCGAGA .CiACTTCCGTA CTGTGTTAAA AACTGTTTGA GGAACAC7GC ATTAA ATG 56 

Met 

ATG CTT CCA CTG TTC TGC TCT CCC TGG GAA AGC GGA GGC AGA ACG GTG 106 
Met L9U Pro Leu Phe Cys Ser Pro Trp Giu Ser Gly Giy Arg Thr Val 
-10 -5 1 

AAG CAG AGT GAA GGN YCT TGT TWA TTC CAG GCC CCC CAT GGG 14 8 

Lys Gin Ser Glu Gly Xaa Cys Xaa Phe Gin Ala Pro His Giy 
5 10 15 



(2) INFORMATION FOR SEQ ID NO: 190: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14.0 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
<F) TISSUE TYPE: Uterus 



(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 27.. 71 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.7 

sec KLLSDLSVDSARC/K? 

Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 



ATCTTAACAG AACTTTACAG ACTAGC ATG GCA AAG CT? CTC TCC GAT CTT AGT 53 

Met Ala Lys Leu Leu Ser Asp Leu Ser 
-15 -10 



GTG GAC AGT GCT CGC TGC AAG CCT GGG AAT AAC CTT ACC AAA TCA CTC 101 
Val Asp Ser Ala Arg Cys Lys Pro Gly Asn Aan Leu Thr Lvs Ser Leu 
-5 1 5 ' 10 



TTG AAC ATT CAT GAT AAA CAA CTT CAA CAT CAC CCA CCG 
Leu Asn lie His Asp Lys Gin Leu Gin His Asp Pro Arg 
15 20 



140 



(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: NUCLEIC ACID 
;C) STRANDEDNESS: DOUBLE 
:D) TOPOLOGY: LINEAR 
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MOLECULE TYPE: CDMA 

;vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
iT) TISSUE TYPE: Testis 

(ix) FEATURE: 

.;A) NAME/KEY: sig_peptide 
(5) LOCATION: 199.. 252 

(C) IDENTIFICATION MSTHOO: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.7 

seq VCWGHLLPARVST/RS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 



AGAGAYCAAC TCTATTTGAG CAA3AGTKAG GAAGATTTCC CTGTCTCCCA GCTGAGTAAC €0 

CACTCAGGTT TATTTAAATC CAGTTTAAAT ATGGTTTCAG TAATGATTTT CCAATGGTCT 120 

ACAGCAAAGA ATGGTGCTCC AAGCCTGAAC ATTGAGCACG ACCCAGGTCA TATGCACAAC 180 

ACGACAGGTT GAGCGTCC ATG TGT GGC TAC TGG GTT TGC TGG GGA CAC CTC 231 
Met Cys Gly Tyr Tro Val Cys Trp Gly His Leu 
-15 -10 

TTG CCT GCC AGG GTG AGC ACA CGC AGC AGT GAG CAG CCC CGT GTG ACC 279 
Leu Pro Ala Arg Val Ser Thr Arg Ser Ser Glu Gin Pro Arg Val Thr 
-5 15 

CCA CGG GAT GAG GAT GCC ATG ATG TCA GCA TCC CTT CTG ACT TGG AGG 327 
Pro Arg Asp Glu Asp Ala Met Met Ser Ala Ser Leu Leu Thr Trp Arg 
10 15 20 25 

TAT GTG ACA TTC ATG GTG CCA ATG CCA CTG TCA CCT TGC AGA TCA GTC 375 
Tyr Val Thr Phe Met Val Pro Mat Pro Leu Ser Pro Cys Arg Ser Val 
30 35 40 

TGG GTT TGC TTC AGA CAG AAG ATC CTG ,GAA TAT GTT CAN GCA 417 
Trp Val Cys Phe Arg Gin Lys He Leu Glu Tyr Val Xaa Ala 

45 50 55 



(2) INFORMATION FOR SEQ ID NO: 192: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 base pairs 
(0) TYPE: NUCLEIC ACID 
(C) STRANDEDNE3S : DOUBLE 
(DJ TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) 0R:GINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
{?) TISSUE TYPE: Uterus 

(iz) FEATURE: 

(A) NAME/KEY: sig_peptic:e 
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(B) LOCATION; 65.. 137 

(C) IDENTIFICATION METHOD: Von Heiine matrix 

(D) OTHER INFORMATION: score 3.7 

seq AILGLSTFLNLLS/IN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 



AGCTGC3CAC AGATSATTGA ATTGCGGGGT TGCTGTAGGA ACCGCTGCTA TTGCCGCAGG 60 

AGGAG ATG AAG TTA TCT TGT GCA GGC TGT GCA GAC ACA GCC ATT TTG GGA 110 
Met Lys Leu Ser Cys Ala Gly Cys Ala Asp Thr Ala He Leu Gly 
-20 -15 -10 

CTC AGC ACT TTC CTT AAT TTA CTT TCC ATC AAC CTG C7C GGA ATG ATT 158 
Leu Ser Thr Phe Leu Asn Leu Leu Ser He Asn Leu Leu Gly Met He 
-5 15 

TCT TTC TCT 167 
Ser Phe Ser 
10 



(2) INFORMATION FOR SEQ ID NO: 193: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 base pairs 
(3) TYPE: NOCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME /KEY: sig^peptide 
(3) LOCATION: 75.. 137 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(0) OTHER INFORMATION: score 3.7 

seq FSLGSCPAGPLSA/CV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 



ATGTCACATT TAASNAGAGG CCAGAGCTTG TCCAAAATGG CTGTCCRWAM ACGACCCCAC 60 

ACTTGCGTTA GAAG ATG ATA CCT TTT TCA GGG ACA GTT TTC TCT CTT GGC 110 
Met He Pro Phe Ser Gly Thr Vai Phe Ser Leu Gly 
-20 -15 -10 

TCC TGT CCC GCT GGC CCT CTG TCT GCC TGT GTC CCT GAC CAT GGC TCC 153 
Ser Cys Pro Aia Gly Pro Leu Ser Ala Cys Val Pro Asp His Gly Ser 
-5 1 5 

CTG CAG TAC CCT TTA ACG ATT TAT CAG CAA G.=.C TGT KG A ACG CAT AR3 206 
Leu Gin Tyr Pro Leu Thr He Tyr Gin Gin Asp Cys Kaa Thr His Kaa 
10 15 20 
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TGC CCA AG A TGC CTG TCC CTC CCC CTC CAG CAC CCC CGA CA3 248 
Cys Pro Arg Cys Leu Ser Leu Pro Leu Gin Kis Pro Arq Gla 
25 30 35 



(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

{ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hono Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 70.. 174 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.7 

seq PAV3LSAPAFASA/LR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 



AGAGGACAGG GTAGAGTCGC AGAAAGGAGA GACACACATA C.ATGGAAAGA GGAGCYTTCT 60 

CCAATCTTA ATG ATT CCC AGC TCT CAG CCT CGT TTC TGM AAC CCA GCC TGC 111 
Met He Pro Ser Ser Gin Pro Arg Phe Xaa Asn Pro Ala Cys 
-35 -30 -25 

AAG CAA ACT GTC CTG CTT WGG GAC CCT GCT GTG TCA CTC TCC GCA CCA 159 
Lys Gin Thr Val Leu Leu Xaa Asp Pro Ala Val Ser Leu Ser Ala Pro 
-20 -15 -10 

GCC TTT GCC TCT GCT CTT CGC TCT ATG AT-IG TCC TCC CAG GCT GCA CGG 207 
Ala Phe Ala Ser Ala Leu Arg Ser Met Xaa Ser Ser Gin Ala Ala Arg 
-5 1 5 10 

AAG GAC GAC TTT CTC AGG TCT CTT AGT GAT GCA QTiC TCA GGG ACA TCA 255 
Lys Asp Asp Phe Leu Arg Ser Leu Ser Asp Gly Asp Ser Giy Thr Ser 
15 20 25 

GAA CAC ATC TCA GCG GTG GTG ACT AGC CCT CGG ATT TCC TGC CAT GGT 303 
Glu His He Ser Ala Val Val Thr Ser Pro Arg lie Ser Cys His Giy 
30 35 40 

GCT GCC ATT CCC ACC GCC CGT GCC CTC TGC CTA YGC TGT TCC TGC TGC 351 
Ala Ala lie Pro Thr Ala Arg Ala Leu Cys Leu Xaa Cys Ser Cys Cys 
45 50 55 



ACC GAA CGC 
Thr Giu Arg 
60 



360 
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(2) INFORMATION fOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNE3S : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 161.. 205 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.6 

seq PTFLLISDSFLTS/QP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 



ATTGGCCTGC TCTTCCTGAT ACCTACTTGG TCACTACTTA ATTACATTTT GTTTGTGTAT 60 

CTTTTTTCTT CAGGCTGTAA ATTCTCTAAA GGCATTTTGC TTATTTTGGT GTCACAATTG 120 

TTTAGGCCAT GCGCCTAGGT CTTCTTAAAA CACCTCTCTC ATG GCT CCT ACT TTT 175 

Met Ala Pro Thr Phe 
-15 

CTA CTT ATT TCT GAT TCT TTT CTG ACT TCT CAG CCT TCT TTT TTT TTT 223 
Leu Leu lie Ser Asp Ser Phe Leu Thr Ser Gin Pro Ser Phe Phe Phe 
-10 -5 1 5 

TTT 226 
Phe 



(2) INFORMATION FOR SEQ ID NO: 196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 2197.275 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
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:D) other IMFORMATICN: score 3.5 

seq LSLLGIKIQWCLS/EN 

■xi: SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

AAAAAATACA CKGAGATTA? G7ACGATTTA GTGATTTGGT GGGATAATTA TAAATCGTGG 60 

AATAATTTAT ATATGTGGAG TAAGAAGAGA GGGGTCAAAC CTTTTGGTAC AAGCAACATC 120 

TTGTTGCCAC CACCTTGATT TTCTCATAGG TGCTATTGTG TCCTAAGAGT RGRACAGRSR 180 

RGRAAACAAA GATAATTAAA CACAAGTCAG GTTACAAC ATG ATA TCT ?TA ATT GTA 236 

Met lie Ser Leu lie Val 
-15 

CTT TCT CTG CTT GGT ATC AAG ATT CAG TGG TGC TTG TCA GAA AAT ACC 284 
Leu Ser Leu Leu Gly lie Lys lie Gin Trp Cys Leu Ser Glu Asn Thr 
-10 -5 1 

TTG TTC TGT GAC TCT GAC TAT CTC TTG AGT CCC AAG GCT CCA ATT GAG 332 
Leu Phe Cvs Aso Ser Asp Tyr Leu Leu Ser Pro Lys Ala Pro lie Glu 
5 ' ' 10 15 

CCT TTA TCT TTC PAC CTT ACC ACC CAG GGG 362 
Pro Leu Ser Phe Asn Leu Thr Thr Gin Gly 
20 25 



(2) INFORMATION FOR SEQ ID NO: 197: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: oig_pcptide 

(B) LOCATION: 129.. 257 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.6 

sec LLYR^'TFLPRKVA/RV 

(zi) SEQUENCE DESCRIPTION: SEQ ID NO: 197; 



ATTAAGTCCT GCATTTTGTA AGAGGCAAAT GGAGAGTAAC AGAAGAGTGT CTTTTCTCCT 50 
GGTTTTGGAG TCTTGCACTG GCCATGAGTG TTGKGACTGA TGGTCRACCC AGGCGGGCAT 120 



TTTAJkTAA ATG GCC TGT GAT TCT TTT TTG AAA G.a.T GCT CTT CCA CrV. GAG 17 0 
Met Aia Cys Asp Ser Phe Leu Lys Asp Aia Leu Pro Gin Glu 
-40 -35 -30 
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TTA AGC CA3 TTA GRR TTC C?G TTT CCT CTT 3TT GAT ATG A3A GA.A GAT 218 

Leu Ser Gin Leu Xaa ?he Leu Phe Fro Leu Val As? Met Arg Giu Asp 
-25 -20 -15 

CTC CTG TAT TTT AAC ACG TTT TTA CCA AGA AAG GTG GCA CGG GTG 253 

Leu Leu Tyr Phe Asn Thr Phe Leu Pro Arg Lys Vai Ala Arg Vai . 

-10 -5 ' 1 



(2) Ii^FORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

fA) LENGTH: 216 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hono Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 13. .171 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.6 

seq FLILHFFPQQIRK/KI 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 



ATTCTCAATC PA ATG TTG CTA TTG AA.T GAA AAC CTC PJ^A GCA GAA ATT CPA 51 
Met Leu Leu Leu Asn Glu Asn Leu Lys Ala Giu lie Gin 
-50 -45 

AAG AAT GAA GCT CAA GGC AGC TGT ATC TTG TTT CTG TTT TGC TTT GAG 99 
Lys Asp Giu Ala Gin Gly Ser Cys lie Lej Pne Leu Phe Cys Phe Giu 
-40 -35 -30 -25 

AGT CAG AAT ATG CGA TCA AAA TCT ATC TTC CCC TTC CTT ATC CTT CAT 147 
Ser Gin Asn Met Arg Ser Lys Ser He P.^e Pro Phe Leu lie Leu His 
-20 -lo -10 

TTT TTT CCC CAG CAG ATA AGA AAA AAA ATA GTG GTG CTT CTT TTA GGA 195 
Phe Pne Pro Gin Gin lie Arg Lys Lys lie val vai Leu Leu Leu Gly 
-5 1 5 



CTT AAT TCT CAG AAG GCA GGG 
Leu Asp. Ser Gin Lys Aia Gly 
10 15 



216 



(2) IMFORMATION FOR SEQ ID NO: 199: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 base pair.s 
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(B) TYPE: NUCLEIC ACID 
-- iC) STRANDEDNESS : DOUBLE 
{D) TOPOLOGY: LINEAR 

MOLECULE TYPE: CDNA 

■vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

iix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 6., 83 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq LLPFTFLSLKAFL/QX 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 199: 



AGCTG ATG ATA AGT AAG TAT GTG CAT TAT AGC TTG ACT GAC TTA CTA TTA 50 
Me- lie Ser Lys Tyr Val His Tyr Ser Leu Thr Asp Leu Leu Leu 
-25 -20 -15 

CCT TTT ACA TTC TTA AGC CTT AAA 6CC TTT CTG CAG YYA AGA GTT TTA 98 
Pro Phe Thr Phe Leu Ser Leu Lys Ala Phe Leu Gin Xaa Arg Val Leu 
-10 -5 I 5 

ATG TCT CTT CCT CAA CAC AAG CCC TGG 125 
Met Ser Leu Pro Gin His Lys Pro Trp 
10 



(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 base pairs 

(S) TYPE: NUCLEIC ACID 

(C) STRANDCDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 42.-122 

CO IDENTIFICATION METHOD: Vnn Hpijn« matrix 
(D) OTHER INFORMATION: score 3.5 

seq CSLLSSFCALHFG/LK 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 



A.-.TAr37GAT CAAACGCCCA GGAGCCAGCT GGTGASAAAG A ATG GCG AGG ACA ATG 56 

Met Ala Arg T.hr Met 
-25 
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GGA GTT CCC AGA GCA TGC AAG GCC TTC TGI AGC CTC CTZ TCC AGC TTC 104 
Gly Vai Pro Arg Ala Cys Lys Ala Phe Cys Ser Leu Leu Ser Ser ?he 
-20 .15 -10 

TGI GCA TTA CAC TTT GGG CTC AAG AAJV CAG TAT GGT ACT TC? TAC CTC 152 
Cys Aia Leu His Phe Giy Leu Lys Lys Gin Tyr Gly Thr S^r Tyr Leu 
-5 1 5 10 

CAT GCC TGT GCT TAT GOT AGC CCC TTG AGC TGG GGT CCC TGG 194 
His Ala Cys Ala Tyr Ala Ser Pro Leu Thr Tro Gly Pro Trp 
15 20 



(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SSQUENC£ CHARACTERISTICS: 

(A) LENGTH: 348 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

{vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 262.. 306 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq LCFLLPHHRLQEA/RX 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 



ATTTCGCGGC GCTCGCBGMA CYHSGWTGTT CAGCACCTTC GGTCCGGTTG AGGTTGTCAA 60 

GTCGGMCCA.A ACAGGTTGTT TCTCTGCAGT TTCCAACATG GCAGGGMSGT TTA-ATAGACA 120 

TGG.ATAAG.AA GTCCACTCAC AGAAATCCTG AAGATGCCAG GGCTGGCAAA TATGAAGGTA 130 

AACACAAACG AAAGAAAAGA AGAAAGCAAA ACCAAAACCA GCACCGATCC CGACATAGAT 2 40 

CAGTGACGTC TTTTTCTTCA G ATG ATC CTA TGT TTC CTT CTT CCT CAT CAT 291 

Met lie Leu Cys Phe Leu Leu Pro His His 
-15 -10 

CGT CTT CAG GAA GCC AGA YAG ATT CAA GTA TTG AAG ATK CTT CCA AGG 339 
Arg Lau Gin Glu Ala Arg Xaa He Gin Vai Leu Lys Xaa Leu Pro Arg 
-5 1 5 10 

GAA AAA TTA 34 8 

Glu Lys Leu 



:MroRr^.AT:oM for seq id no: 202: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 55 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : EX}UBLE 

(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig_pepcide 

(B) LOCATION: 7.. 84 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq QCFFVCFSPKIYG/VI 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 



AATAAC ATG CAA GAC TAC GTT TCA CAT GCA GTA CGG CGC CAC TGT CAG 4 9 

Met Gin Asp Tyr Val Ser His Ala Val Arg Arg His Cys Gin 
-25 -20 -15 

TGT TTT TTT GTT TGT TTT TCC CCC AAG ATT TAT GGC GTA ATA ACA TGG 96 
Cys Phe Phe Val Cys Phe Ser Pro Lys He Tyr Gly Val He Thr Trp 
-10 -5 1 

ACC GTC C7G ATA ACT GGA GCC CGG GTT CTG TCA GAG CCC CAG AGG TTG 14 4 
Thr Val Leu He Thr Gly Ala Arg Val Leu Ser Glu Pro Gin Arg Leu 
5 10 15 20 

TGG GTT AGA CTT GAT GAC ATA ACA GCA AAT GCA GCG TGT GGT TAC AGA 192 
Trp Val Arg Leu Asp Asp He Thr Ala Asn Ala Ala Cys Gly Tyr Arg 
25 30 35 

AAG CAA GAG CCG CGG AAG ACG TTT GAA AAC AAT TGG GAA AAT TTG TAT 24 0 
Lys Gin. Glu Pro Arg Lys Thr Phe Glu Asn Asn Trp Glu Asn Leu Tyr 
40 45 50 

ACG GAC TGG AAC TGG 255 
Thr Asp Trp Asn Trp 
55 



(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS; DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vT; ORIGINAL SOURCE: 

(A) ORGANISM: homo Sapiens 
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(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: 120T.212 

(C) IDENTIF:CATI0N method: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq VLLNLALSHFNNC/GL 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 



AGATYTTATA ATCTTGCTAC AAAGAAAGTA GGACAGTCTC AGCCTTTAAG AA7GTCACTA 

TAACAGTTTT TTTTTTCCTT AAGGATATTT TAAACAGGAA AGTAGACAAC CGGGTAAGC 

ATG GAG TTT GCT CAT GCT GCC GAA TGT GTG TCT TTT GCC CTA AAT GAA 
Met Giu Phe Ala His Ala Ala Glu Cys Val Ser Phe Ala Leu Asn Glu 
-30 -25 -20 

ACG CAC GTT CTT CTA AAT TTA GCC CTA TCA CAT TTT AAC AAT TGT GGC 
Th.r His Val Leu Leu Asn Leu Ala Leu Ser His Phe Asn Asn Cys Gly 
-15 -10 -5 1 

etc GCA GTG 
Leu Ala Val 



(2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 133.. 222 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq LLAA5WLPRDAPC/EA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 



ACAGATTGCT TTCCAAGCTG AACATATGCA ACTGTATTGC TAAACTTACC AATTTCAGGG 

AATCTGGGCG TCAAAAGCAT CCACATCCCT GCAGCAGGCC CCTGGGGAGG TAGGCAGGGT 

GACAGCTGGG AA ATG CCR PAC CAG GGC TTT CCA TkC CTG TCT CCT TCT CTC 

Met Gly Asn Gin Gly ?.*:e Pro Tyr Leu Ser Pro Ser Leu 
-30 -25 -20 



60 
119 

215 
224 



60 

I2C 
171 
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AGT GTC CAG GAT CT? C?T GCT GCT TCA TGG CTG CCC CGA GAT GCT CCC 219 
Ser Val Gin Asp Leu Leu Ala Ala 3er Trp Leu Pro Arc Asp Ala Pro 
-15 -10 -5 

TGT GAG GCC CCC CCG GGC CTG CCT TCA CAG ACA ATG CTC TGT GCC GCT 267 
Cys Glu Ala Pro Pro Gly Leu Pro Ser Gin Thr Met Leu Cys Ala Pro 
15 10 15 

GGA CCA AGG 2'' 6 

Gly Pro Arg 



(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: 68.Tl33 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq AQLASPLLPGATP/VA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 



AACAAATTGA TCTTGTGTGA TGAGTGTAAT AAAGCCTTCC ACCTGTTTTG TCTGAGGCCG 60 

GCCCTCT ATG AAG TAC CAG ATG GTG AGT G::C AGT GCC CAG CTT GCC AGC 109 
Met Lys Tyr Gin Met Val Ser Gly Ser Ala Gin Leu Ala Ser 
-20 -15 -10 

CCG CTA CTG CCA GGC GCA ACT CCC GTG GCA 3GA ACT ATA CTG AAG AGT 157 
Pro Lea Leu Pro Gly Ala Thr Pro Val Aia Gly Thr lie Leu Lys Ser 
-5 1 5 

CTG CTT CTG AGG ACA GTG AAG ATG ATG AGA GTG TAT GGG 196 
Leu Leu Leu Arg Thr Vai Lys Met Met Arg Val Tyr Gly 
10 15 20 



[2) INFORMATION FOR SEQ ID NO: 2C6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 base pairs 
(3) TYPE: NUCLEIC ACID 
(C; STPANDEDNESS: DOUBLE 
(D) TOPOLOGY: LINEAR 
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(ii) MOLECULE TYPE: C3NA 



{VI) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE; Testis 

(ix) FEATURE: 

(A) NAME/KEY: ocher 

(B) LOCATION: 55.. 94 

(C) IDENTIFICATION METHOD: blastn 
(0) OTHER INFORMATION: icencicy 100 

region 1 . . 40 
id AA134726 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 39.. 121 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 93 

region 34 . . 66 
id ;iJ\134726 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 72.. 140 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 95 

region 1 . . 69 
id r:7226 
est 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 41.7l03 

(C) IDENTIFICATION METHOD: Von Hei3ne natrix 

(D) OTHER INFORMATION: score 12.7 

sec ILFLLSWSGPLQG/QQ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 



AGTACGTTCC TTCTACTCTG GCACCACTCT CCAGGCTGCC ATG GGG CCC AGC ACC 55 

Met Gly Pro Ser Thr 
-20 

CCT CTC CTC ATC TTG TTC CTT TTG TCA TGG TCG GGA CCC CTC CAA GGA 103 
Pio Leu Leu Il« Leu Phe Leu Leu Ser Trp Ser Gly Pro Leu Gin Gly 
-15 -10 -5 

GAG CAG CAC CAC CTT GTG GAG TAC ATG GA.^. CGC CGA CAC GGG 14 5 

Gin Gin His His Leu Vai Glu Tyr Met Giu Arg Arg His Gly 
1 5 10 



:2: INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

{A; LENG?!!: 172 base pairs 
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(3) TYPE: MUCLEIC ACID 
(C) STRANDSDKSSS: DOUB-i 
{DJ TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 73.. 169 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 73 . . 169 
id W25639 
est 



(ix) FEATURE: 

(A) N.AME/KEY: other 

(B) LOCATION: 37. .81 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 91 

region 38 . . 82 
id W25639 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 42.. 169 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 100 

region 1 . . 128 
id AA040C16 
est 



(ix) FEATURE: 

(A) N.AME/KEY: other 
(3) LOCATION: 34.. 169 

(C) IDENTIFICATION METHOD: blastr. 

(D) OTHER INFORMATION: identity 96 

regica 2 3. ,158 
id R72515 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 47.. 169 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORt^ATION: idsnzicy 99 

region 1 . . 123 
id T64313 
est 



Ux) FEATURE: 

(A) H.-i'IE/KEY: sig_peptide 

(B) LOCATION: 86.Tl45 

(C) IDEMTIFICATION METHOD: Von Heijne matrix 
(C) OTHE?. IM FORMATION: score 7.4 

sec LVFCVGLLTMAK-=v/E3 
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:xi} SEQUENCE DESCRIPTION: SEQ ID NO: 207: 



AAGGGGGTCC AAAGTGC7CA GCCCCCGGGG CACAGCAGGA CGTTTG3GGG CCTTCTTTCA . 60 

GC^GG3GkCP'. GCCCGATTGG GGACA ATG GCG TCT CTT GGC CAC ATC T7G GTT 112 

Met Aia Ser Leu Gly His He Leu Vai 
-20 -15 

TTC TGT GTG GGT CTC CTC ACC ATG GCC AAG GCA GAA AGT CCA AAG GAA 160 
Phe Cys Vai Gly Leu Leu Thr Met Aia Lys Aia Giu Ser Pro Lys Giu 
-10 -5 15 

CAC GAC CCG AGG 172 
His Asp Pro Arg 



(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 193 base pairs 
{31 TYPE: NUCLEIC ACID 
CO STPANDEDNESS: DOUBLE 
(D) TOPOLOGY: LINEAR 

iii) MOLECULE TYFE: CDNA 

(vi) ORIGINAL SOURCE: 

(A} ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A; NAME/KEY: Other 

(3) LOCATION: 46.. 192 

to IDENTIFICATION METHOD: blastn 

(D; OTHER INFORMATION: identicy 98 

region 5 . . 151 
id R14326 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 46.. 192 

IC) ID::nTIFICATION method: blastn 
(D) OTHER INFORMATION: identity 91 

region 129. .275 

id W55137 

est 

{i:c) FEATURE: 

(A) MAME/KEY: other 

(3) LOCATION: 57.. 192 

iC) IDEMTIFICATION METHOD: blastn 

(C) OTHER INFORr^TION: identity 91 

region 1 . . 136 
id W64115 
est 

{::-:; rZATU^Hl: 
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-- (A) NAME/KEY: other 
(3) LOCATION: 57., 192 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 91 

region 1. . 136 
id W75505 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 78.. 192 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 97 

region 1. . 115 
id W20303 
est 



(ix) FEATURE: 

{A) S'AME/KEY: sig^peptide 

(B) LOCATION: 53.. 121 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 7.3 

seq AL3LLLVSGSLLP/G? 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 



ACTCAATAAA TGTTTTCCGC ATTAAGACGC TTCTTAGGAG TCTTCATGGA GG ATG TCG 58 

Met Ser 

GGT TCG TCG CTG CCC AGC GCC CTG GCC CTC TCG CTG TTG CTG GTC TCT 106 
Gly Ser Ser Leu Pro Ser Ala Leu Ala Leu Ser Leu Leu Leu Val Ser 
-20 -15 -10 

GGC TCC CTC CTC CCA GGG CCA GGC GCC GCT CAG PJKC GAG CCA AGG ATT 154 
Gly Ser Leu Leu Pro Gly Pro Gly Ala Ala Gin Asn Glu Pro krq lie 
-5 1 5 10 

GTC ACC AGT GAA GAG GTC ATT ATT CGA GAC AGC CCC GTG 193 
Val Thr Ser Glu Glu Val lie lie Aiq Adu Ser Pro Val 
15 20 



(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHAPACTERISTICS : 

(A) LENGTH: 247 base pairs 

■Q) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

;D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) 0?.IGI:iAL SOURCE: 

(.-.) ORGANISM: Homo Sapiens 
{?) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) MAi-:E/KEV: other 
(3) LOCATION: 7i..207 
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- (C) IDENTIFICATION METHOD: biastn 
(D) OTHER IMFOR>LATICN: identity 100 

region 1 . .137 
id R73005 
est 

•:ix) FEATURE: 

■A) NAME/KEY: other 
{3) LOCATION: 80.. 207 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 1 . . 128 
id N26942 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 86. ,207 

(C) IDENTIFICATION .METHOD: blastn 

(D) OTHER INFORRATION: identity 100 

region 1 . . 122 
id WU2954 
est 

(ix) FEATURE: 

(A) NAi4E/KEY: other 

{B) LOCATION: 112.. 207 

CO IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 1..96 
id T24907 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 137.. 207 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 100 

region 1 . . 71 
id PA130938 
est 

(ix) FEATURE: 

(Al NAME /KEY: sig_pep-icie 
(3) LOCATION: 53. .223 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORM.ATION: score 7.1 

seq VGL.AVVSLGGSRG/SG 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 



AACTG.ACAAG ACGTGGGCCA AGAGGGGTCA CZGCCCCOZG AGCGGCGCGN AS ATG ATG Sr3 

Met Met 

GAA GTC GTA GTA GGA AAT GGC GTC- GTG GCA TTG AGG GGC ATC OCT OCT I 05 
Glu Val Val Val Gly .Asn Gly Val Val Ala Leu Arg Giy He Pro Pro 
-SS -=.0 -45 -4C 

.AGA .ACC TCC AGG .'-J-JK AGC TCG CGG .-^-^.G ACG AGG TTC TGC GGA GAG .AGA 154 
Arg Thr Ser ?-.r:: i-v-: Sor'Ser Arg Lys Thr Arg Phe Cys Gly Giu Arc 
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-35 -30 -25 

GGC TCC PAG CAG TCT GGG AAG TGT AGT CCA GTT GGC TTA GCA GTA GT? 202 

Giy Ser Lys Gin Ser Gly Lys Cvs Ser Pro Vai Gly Lsu Ala Vai Val 
-20 " -15 -10 

TCG TTG GGG G3G AGC CGA GGT TCC GGG AAG GGG CTA GGC CGA CTG 24 7 

Ser Leu Glv Giv Ser Arg Gly Ser Giy Lys Gly Leu Gly Arq Leu 
-5 1 5 



(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SECUENC2 CHARACTERISTICS: 

(A) LENGTH: 373 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNES3 : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

{vi} ORIGINAL SOURCE: 

(A) ORGANISM: Honio Sapiens ^ 
(?) TISSUE TYPE: Testis JT' 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 252. .375 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 93 

region 1 . . 124 
id AA081350 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 313.. 375 

(C) IDENTIFICATION METHOD: blastn 
(DJ OTHER INFORMATION: identity 93 

region 1 . . 58 
id PAOASeil 
est 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 200.. 24 7 

•:C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.7 

sec CFSLVLLLTSIWT/TR 

(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 



AATTTTTCCC CCAGTGACCT TGAC.^^GTCA GAAGCTTGi^A A3CAGGGA-AA TCCGGATGTC 60 

TCGGTTATGA AGTGGAGCAG TGAGTGTGAG CCTCAACATA GTTCCAGAAC TCTCCATCCG 120 

GACTAGTTAT TC-.-.::CATCTG CCTCTCATAT CACC.:\GTGGC CATCTGAGGT GTTTCCCTGG 180 

CTCTG.iAGGG v^^T.-. -■•CACr, ATG GCC Av" TGC TTC AGC CIT. GT3 TTG CTT CTC 232 
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Met Ala Arg Cys Phe Ser Leu Vai Leu Leu I-au 
-15 -10 

ACT TCC ATC TGG ACC ACG AGG C?C CTG GTC CAA GGC TCT TTG CGT GCA 230 
7hr Ser lie Trp Thr Thr Arg Leu Leu Vai Gin Giy Ser Leu Arg Ala 
-5 * i 5 10 

GAA GAG CTT TCC ATC CAG GTG TCA TGC AGA ATK ATG GGG ATC ACC CTT 328 
Giu Glu Leu Ser lie Gin Vai Ser Cys Arg Xaa Met Giy lie Thr Leu 
15 20 25 

GT3 AGO AAA AAG GCG AAC CAG CAG CTG AAT TTC ACA GAA GCT AAG 373 
Vai Ser Lys Lys Ala Asn Gin Gin Leu Asn ?he Thr Giu Ala Lys 
30 35 40 



(2) IMFOE^ATION cOR SEQ ID NO: 211: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 38 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNES3: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spieen 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 149.. 355 

(C) IDENTIFICATION METHOD: blastn 
(0) OTHER INFORMATION: identity 100 

region 1 . .207 
id R16604 
est 



(ix) FEATURE: 

(A) NAME/KEY: Other 
(3) LOCATION: 354.. 407 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity LOO 

region 20" . . 260 

id ?.:6604 

est 



(ix) FEATURE: 

(A) NAME/KEY: other 
■:2) LOCATION: 14 9.. 352 
•:C; IDENTIFICATION METHOD: blastn 
OTHER INFORMATION: identity 98 

region . . . 2 1h 
id N99550 
est 



{!:<» ri.ATURE: 

;A) h}?l':t/KZ'{ : otr.er 
'b' LCCAtrOM: 33 C. 129 
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(CI IDENTIFICATION METHOD: blastn 
(0) OTHER INFORMATION: identity 93 

region 237. .28S 

id N99558 

est 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 31.. 93 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.6 

seq CLSCLLIPLALWS/II 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

GAAATCTCCC GCAGTTCTAA GCAGGGCAAA ATG GGG TCT CGG AAG TGT GGA GGC 54 

Met Giy Ser Arg Lys Cys Gly Gly 
-20 -15 

TGC CTA ACT TGT TTG CTG ATT CCG CTT GCA CTT TGG AGT ATA ATC GTG 102 
Cys Leu Ser Cys Leu Leu lie Pro Leu Ala Leu Trp Ser He He Val 
-10 -5 

AAC ATA TTA TTG TAT TTC CCG AAT GGG CAA ACT TCC TAT GCA TCC AGC 150 
Asn lie Lea Leu Tyr Phe Pro Asn Gly Gin Thr Ser Tyr Ala Ser Ser 
5 10 15 



AAT AAA CTC ACC AAC TAC GTG TGG TAT TTT GAA GGA ATC TGT TTC TCA 
Asn Lys Leu Thr Asn Tyr Val Trp Tyr Phe Glu Gly He Cys Phe Ser 
20 25 30 33 



GGA TAC TGC CTG GTC ATC TCT GCC TTG GGT CTT GTC CPA GGG CCA TAT 
Gly Tyr Cys Leu Val lie Ser Ala Leu Gly Leu val Gin Gly Pro Tyr 
85 90 95 



198 



GGC ATC ATG ATG CTT ATA GTA ACA ACA GTT CTT CTG GTA CTG GAG AAT 246 
Gly lie Met Met Leu lie Val Thr Thr Val Leu Leu Val Leu Giu Asn 
40 45 50 

AAT AAC AAC TAT AAA TGT TGC CAG AGT GAA AAC TGC AGC AAA AAA TAT 294 
Asn Asn Asn Tyr Lys Cys Cys Gin Ser Glu Asn Cys Ser Lys Lys Tyr 
55 50 65 

GTG ACA CTG CTG TCA ATT ATC TTT TCT TCC CTC GGA ATT GCT TTT TCT 342 
Vai Thr Leu Leu Ser lie lie Phe Ser Ser Leu Gly He Ala Phe Ser 
70 75 80 



390 



TGC CGC ACC CTT GAT GGC TGG GAG TAT GCT TTT GAA GGC ACT GCT GGA 4 33 
Cys Arg Thr Leu Asp Gly Trp Glu Tyr Ala Phe Giu Gly Thr Ala Giy 
100 105 HO 



(2: INFORMATION FOR SEQ ID NO: 212: 

{H SEQUENCE CHARACTERISTICS: 

:A) LENGTH: 378 base pairs 
;B) TYPE: NUCLEIC AClb 

STRANDEDNESS : DOUBLE 
:D; TOPOLOGY: LINEAR 
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(ii) MOLECULE TYPE: CDNA 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TY?E: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 251.. 376 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 1. . 126 
id R16604 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 251.. 376 

CO IDENTIFICATION METHOD: blastn 
(D) OTHER INFORMATION: identity 96 

region 1 . .126 
id N99558 
est 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 1337.195 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.6 

seq CLSCLLIPLALWS/II 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 



ATTTGTTCTC C.-AACAGTAA ACCAGTATTT CACACTGAGA TTGTCGGCTG CGGGTATATT 60 

CCAATTCCCC GTCTCCTCAT GAATATGAAG TGAAGGGCTC TGAMCCTKGG A^GTGGTTCT 120 

PiP,GCAGGGC^ .-A .^.TG GGG TCT CGG i^AG TGT GGA GGC TGC CTA AGT TGT TTG 171 
Met Gly Ser Arg Lys Cys Gly Gly Cys Leu Ser Cys Leu 
-20 -15 -10 

CTG ATT CCG CTT GCA CTT TGG AGT ATA ATC GTG .AAC ATA TTA TTG TAT 219 
Leu He Pro Leu Aia Leu Trp Ser lie lie Vai Asn He Leu Leu Tyr 
-5 1 5 

TTC CCG A.-.T GGG CAA ACT TCC TAT GCA TCC AGC AAT AAA CTC ACC A^C 267 
Phe Pro Asn Gly Gin Thr Ser Tyr Ala Ser Ser Aiiii Lys Leu Thr Asn 
10 15 20 

TAC GTG T3C TAT TTT GAA GGA ATC TGT TTC TCA GGC ATC ATG ATG CTT 315 
Tyr Val rr- Tvr Phe Giu Gly He Cys Phe Ser Gly lie Met Met Leu 
25 ' 30 35 ^0 

ATA GTA A:.-\ PvCA GTT CTT CTG GTA CTG GAG .AAT AA.T AAC AAC TAT ;^--A 363 
Tie Va" Thr Thr Val Leu Leu Val Leu Glu Asn Asn Asn Asn Tyr Lys 
45 50 55 



TGT TGC C.-.i- GGG 
Cys Cys Z.r Scr Gly 



3-'8 
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(2) INFORMATION FOR SEQ ID NO: 213: 

(i; SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 230 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 
{D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CONA 

(vi) ORIGINAL SOURCE: 

(A) ORGAtlISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

{A) NAME/KEY: Other 

(B) LOCATION: 53.. 227 

vC) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 99 

region 190. .364 

id AA043641 ^i. 
est »■ 



(tx) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 92.. 227 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 99 

region 1 . . 136 
id N98697 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 69.. 102 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 94 

region 393. . 426 
id'AA147010 



(i>:) FE.^TUR£: 

(A) NAME/KEY: other 

(B) LOCATION: 53.. 119 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 97 

region 435. .501 
id AA142534 
est 



A) NAME/KEY: s:Lg_peptide 
5) LOCATION: 153.. 209 

C) IDENTIFICATION METHOD: Von Heijne matrix 

D) OTHER INFORMATION: score 5.3 

seq ILFCVSrVFLTHC/TI 
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(xi I, SEQUENCE DESCRIPTION: SEQ ID NO: 213: 



AAACTTTA3C ACATCATTTT GGCAT7CTAG AATATTTCAT CTACCATACT ATAATTACC7 

GAZsGACAirA GGAGAAT ACA AACTTGCAGG TGTTTTTCTT GGAGGTCGTT CAATGGGC7C 

AAGAGCAGCT GCTTCTGTAA TGTGTCACAT TGAGCCAG ATG ATG GTG ATG ATT TTG 

Met Met Vai Met lie Leu 
-15 

7TC GGG GTC TCA TTT GTA TTT CTT ACC CAC TGC ACC ATC CAA AGC AGC 
Phe Gly Vai Ser Phe Vai Phe Leu Thr His Cys Thr lie Gin Ser Ser 
-10 -5 1 5 

TGC GGG 
Cys Gly 



(2) INFORMATIOf^ FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 394 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(11) MOLECOLE TYPE: CDNA 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 310.. Jy3 
{CI IDENTIFICATION METHOD: biastn 
(D) OTHER INFORMATION: identity 93 

region 1 . . 34 
id HUM4 26A07B 
est 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(31 LOCATION: 293.. 349 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.4 

seq VLVSLPHPHPALT/CC 

(xi) SEQUENCE DESCRIPTION:: SEQ ID NO: 214: 



AAACCTTGTT GCTAGGGACC GGGCGGTTTG CGGCAACCG7 GGGC.'VCTGCT GA.=iTTTGAAT 
TGAGGGGCGA GGGAAAAGTT TTCCTCAGGT GTGGTGGGGA GAGGGAGGCG GATGCCGGNG 

AAACCGTAGG r'J^.CGCGGTCA GAAAGGCGAC GCGCTCTCGG ACTTGGAAAG GGACGCCTGG 
TTTCCCrCC;-. .--.^CG-AACCGG G.=vTGGGA.-.GT GACTTC.V.TG AGATTGAACT TCAGCTGG.=v? 
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TGAAAGAGAG' GCTAGAAGTT CCGCTTGCCA 3CAGCCTCCT TAGTAGAGCG GA ATG AGT 293 

Met Ser 



AAT ACC CAC ACG GTG CTT GTC TCA C7T CCC CAT CCG CAC CCG GCG CTC 34 6 
Asn Thr His Thr Vai Leu Vai Ser Leu ?rc His Pro His Pro Ala Leu 
-15 -in -5 

ACC TGC TGT CAC CTC GGC CWC CCA CAC CCG GTC CGC GCT CCC CCC CCG 394 
Thr Cys Cys His Leu Gly Xaa Pro His Pro Vai Arg Ala Pre Arg Pro 
1 5 iC 15 



(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 473 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDMA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 111.. 321 

(C) IDENTIFICATION METHOD: biascn 

(D) OTHER INFORMATION: identity 90 

region i . .211 
id N41784 
est 

ii-A) FEATURE: 

(A) NAME/KEY: othar 
(3) LOCATION: 143.. 237 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 97 

region 5 . . 99 
Id T7Cii5 
est 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 99.. 41*6 

(C) IDENTIFICATION METHOD: Vcn Heijna matrix 

(D) OTHER INFORMATION: score 4.1 

seq IITLACVPMTSFT/RN 

ixi) SEQUENCE DESCRIPTION: SEQ ID SO; 215: 



ACAGC.-.rCCT TTTCAAGGAT ADCTG.AACAG .AACCTTCT.AA. GTCTCAGACA CGTAAACCCA 60 

AGTGT:-GC.-J\ AGGAACTCAT TGCTCTCG.AA ATGCATAT ATG TKG GTT TAT ACA CTC 116 

Met Xaa Vai Tyr Arg Leu 
-ICS 
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CAA ACT GAR AAG CCC ?JKC ACT ACT GTK CAJ^ GIT CCA GCC TTT CTT 154 

Gin Thr Gin Glu Lys Pro Asn Thr Tnr Val Gin Val Pro Ala ?he Leu 
-IOC -95 -90 -85 

Ci\Pi GAG CTG GTA GAT CGG GAT AAT TCC AAA TTT GAG GAG TGG TGT ATT 212 
Gin Glu Leu Vai Asp Arg Asp Asn Ser Lys Phe Glu Glu Trp Cys lie 
-80 -75 -70 

GPiPi ATG GCT GAG ATG CGT AAS AAA GTG TGG ATA AA.G GAA AAG CAA AAC 260 
Glu Met Aia Giu Met Arg Xaa Lys Vai Trp He Lys Glu Lys Gin Asn 
-65 ' -60 -55 

ACG AAG AGG TTA AGG AGC TGT ACC AAA GGT TAC CTG CTG GAG CTG AGC 308 
Tar Lys Arg Leu Arg Ser Cys Thr Lys Gly Tyr Leu Leu Glti Leu Ser 
-50 -45 * -40 

CCT ATG ACT TTG TGT CTG TGG AAT GGC TGG AAA AGT GGT TGG ATG AAT 356 
Pro Met Ser Lau Ser Leu Trp Asn Gly Cys Lys Ser Gly Trp Met Asn 
-35 -30 -25 

CAG CAA NTA CCA AAC CTA TTG ATA ATC ACG CTT GCC TGT GTT CCC ATG 4 04 
Gin Gin Xaa Pro Asn Leu Leu He lie Thr Leu Aia Cys Vai Pro Met 
-20 -15 -IC -5 

ACA AGC TTC ACC GGG AAT AAA ATA TCA ATT ATG AAG AGG ATA TCT GAA 4 52 
Thr Ser Phe Thr Arg Asn Lys lie Ser He Met Lys Arg lie Ser Giu 
i 5 10 

TAT GCA GCK GAC ATT TTC TAT 473 
Tyr Aia Aia Asp He Phe Tyr 
15 



(2) IMrORMATION FOR SEQ ID NO: 216: 

;i) sequence: characteristics: 

(A) LENGTH: 134 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANOEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: riuiiio Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

{A) NAME/KEY: other 

(B) LOCATION: 63.. 133 

(C) IDEMTIFICATIOM METHOD: biascn 

(D) OTHER INFORMATION: la^n-ity 97 

reg.on 132. .222 
id AJ^.D4 397 4 
est 

;i:-:) FEATURE: 

(A) NAME/KEY: ocner 

•: = ; LOCATION: 93.. 133 

'Z) IDENTIFICATION METHOD: blas-.,n 
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(D) OTHER IMFOS^.TION: identify iOO 

region 1 . . 35 
ic WC55C1 
est 



{-ixl FEATURE: 

{A) NAME /KEY: sig_peptide 

(B) LOCATION: 54.. 116 

(C) IDENTIFICATION METHOD: Vcn Heijne matrix 

(D) OTHER INFORMATION: score 3.9 

seq LIAVVIIILLIFT/SV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 



AGACTTTGCT GATTTAGCTT ATGGAAGAGG AACCAGAAAT TTGTCCT7GA ATA ATG 56 

Met 



TTT CCC GTG TTG GGC TGG ATC TTG ATA GCA GTW GTY ATC ATC ATT CTT 
Phe Pro Vai Leu Gly Trp lie Leu lie Ala Vai Val lie lie He Leu 
-20 -15 -10 -5 



104 



CTG ATT TTT ACA TCT GTC ACC CGA TGC CTG 
Leu He Phe Thr Ser Val 7hr Arg Cys Leu 
1 5 



134 



(2) INFORMATION FOR SEQ ID NO: 217: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 202 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 



(11) KOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FE.ATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 153. .199 

(C) IDENTIFICATION METHOD: blastn 
CO) OTHER INFORMATION: identity 97 

region 1 . . 47 
id R14297 
est 



(i:<) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 8 . . 64 

(C) IDENTIFICATION METHOD: Von Keijne natrix 

(D) OTHER INFORMATION: score 3.9 

seq SVCLCPCLNKGQ5/EN 

SEQUENCE DESCRIPTION: SEQ ID UO: 217: 
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ATG TTC TCC TGC TGT ATC TCA GTT TGT CTA TGT CCT TOT CTC 4? 
Me- Phe Ser Cys Cys lie Ser Vai Cys Leu Cys Pro Cys Leu 
-15 -10 

AAC AA.A GGC CAA AG? GAG AAT CTT TCC AGA GAC TGC GGW CAT TGG CTG 97 
AsPi Lys Giy Gin Ser Glu Asn Leu Ser Arg Asp Cys Giy His Trp Leu 
-5 1 5 10 

AAC CCT CAC CAT CGA CGC CTC TGG CCA TTT GGC AGA AGG CAC CCA CAG 14 5 
Asn Pro His His Arg Arg Leu Trp Pro Phe Giy Arg Arg His Pro Gin 
15 20 25 

GAT TGT GGA CTC TTC CAA GAT TCA CAA TGR TAT GGT GAA TCC A.AA GAC 193 
Asp Cys Giy Leu Phe Gin Asp Ser Gin Xaa Tyr Giy Giu Ser Lys Asp 
30 35 40 

TGG AAC GGG 202 
Trp Asn Giy 
45 



(2) IMFORM^TION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4C6 base pairs 

{3) TYPE: NUCLEIC ACID 

(C) STRANDEONESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Uterus 

iiK) FEATURE: 

(A) NAME/KEY: other 
13) LOCATION: 333.. 403 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: idencicy 98 

region 33. . 1C3 
id W73795 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 333.. 403 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFOR^MATICN: identity 93 

region AOS, .470 
id .AA151030 
esc 

(i::) FEATURE: 

(A) NAME/KEY: other 
(=) LOCATION: 333.. 403 
(C) IDENTIFICATION METHOD: biastn 
'j) OTHER IIJFORt'lATICN: identity 93 

region 6. .71 
icj K466-:0 
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est 

(IX) FEATURE: 

(A) NAME/KEY: other 

(3) 2CCATI0N: 338.. 403 

(CI IDSMTirrCATION METHOD: biastn 

(O: OTHER INFORMATION: identity 93 

region 73. . 138 
id R99176 
est 

(ix) FEATURE: 

(A) SAME/KEY: other 

(3) LOCATION: 338.. 403 

(C) IDENTIFICATION METHOD: biastn 

(0) OTHER INFORMATION: identity 93 

region 43. .113 
id W79571 
est 

(ix) FEATURE: 

(A) NAME/KEY: sig_pe?tide 

(B) LOCATION: 1437.229 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq LTYLLLLSPIKYP/LD 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 



ATCGTATTGG CACAGTTCTC TATGTAAGCA ATTTGAGAGG GAAGCAAAGG GGAAAAGTTT 60 

G.AGTTAGCTG TTCTCTGTCC TAGAATTTCC CTGCATTAAT CTTGTCCTTG AAAATATATA 120 

T.J^ATACTGGT CCCTT.'^AACT CC .ATG AGG CTT TGT CTC ATT ATG TAT TGT TCT 172 

Met Arg Leu Cys Leu lie Met Tyr Cys Ser 
-25 -20 

TTT GGT ACC CTT TCC CAC TTA ACT TAC CTT TTG CTC CTA AGT CCT ATA 220 
Phe Gly Thr Leu Ser His Lau Thr Tyr Leu Leu Leu Leu Ser Pro lie 
-15 -10 -5 

AnA TAC CCC TTG GAT CTG GAT TTT TTA TAC CCG ATT TTC TCC ACT GTG 268 
Lvs Tyr Pro Leu Asp Leu Asp Phe Leu Tyr Pro lie Phe Ser Thr Val 

5 10 

TAT AAA AGG TAT ATT GTG ACT GTA AAT TTT TGT ATA TCA TGT TCT GAG 316 
Tyr Lys Arg Tyr lie Val Thr Val Asn Phe Cys lie Ser Cys Ser Glu 
15 20 25 

AGC TTC TTA CTT TCT GAT CTC ATA GCA CTA TTC CTG ATC AGA GAA CTC 364 
5er Phe Leu Leu Ser Asp Leu lie Ala Leu Phe Leu lie Arg Glu Leu 
30 3*5 4C 45 

Z:\G TTG CTT C.=Jk CAC ACA GTA TCA GTA GTG CAG CCA CCC ACG 406 
Gin Leu Leu Gin His Thr Val Ser Val Vai Gin Pro Pro Thr 
50 55 



::i :;.! FORMAT 10?: for seq id NO: 219: 
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(i) SETQ'JENCE CHARACTERISTICS: 

(A) LENGTH: 210 base pairs 
(3) TYPE: ^4UCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 



(ii) MOLEC'JLE TYPE: CDMA 



(VI) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 



(XK) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: compiement { 118 . . 206) 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 97 

region 14 4.. 232 
id T77881 
est 



iiK) FEATURE: 

(A) NAME/KEY: other ^ 

(B) LOCATION: complement ( 64 1 18 ) i"*' 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 98 

region 231. .285 
id T77881 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: complement ( 126 .. 206) 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 100 

region 139. .219 
id R01713 
est 



tix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 70., 14*7 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 10.5 

seq LLLALLLPVQVSS/FV 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NC: 219: 



.3vACCAGCCAG GAGCCACCCA TCCTCCAGCA CACTGAGCA3 C.^GCTGGAC ACACGGCACA 60 

CTG.ATCCAA ATG GGT AA.G GGG ATG GTG GCG ATG CTC ATT CT6 GGT CTG CTA 111 
Met Gly Lys Gly Met Val Ala Met Leii lie Leu Giy Leu Leu 
-25 ' * -20 -15 

CTT CTG GCG CTG CTC CTA CCC GTG CAG GTT TCT TCA TTT GTT CCT TTA 15 9 
Leu Ala Leu Leu Leu Pro Val G.n Val £er Scr Ph3 Val Pro Leu 
-10 -5 1 

agt atc ccg caa gct act gca gcc dw a-c aca aag ccc tcc aat 2 
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Thr'ser Hez Pro Glu Ala Thr Ala Ala Glu Thr Thr Lys Pro Ser Asn 

5 " 10 15 20 

GGG 210 
Gly 



(2) INFORMATION FOR 3EQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 189 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 2. .170 

- (C) IDENTIFICATION METHOD: blastn 
(D) OTHER INFORMATION: identity 98 

region 4 . . 172 
id H56777 
est 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 25. .87 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 9.5 

seq LLVLr^LLANVQG/PG 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 



AAGAAAGGCT GGCCTCTCTT CAAC ATG GGA TCT TCT GGA CTT TTG AGC CTC 51 

Met Gly Ser Ser Glv Leu Leu Ser Leu 
-20 -15 



CTG GTG CTA TTC GTC CTC TTA GCG AAT GTC CAG GGA CCT GGT CTC ACT 
Leu Val Leu Phe Val Leu Leu Ala Asa Val Gin Gly Pro Gly Leu Thr 
-10 -5 1 



99 



GAT TGG TTA TTT CCC AGG AGA TGT CCC AAA ATC AGA GPA GAA TGT GAA 147 

Aso Trp Leu Phe Pro Arg Arg Cys Pro Lys lie Arg Giu Glu Cys Glu 
5 10 20 

TTC CAA GAA AGG GAT GTG TGT ACA AAG GAC AGA CAA TGC CGA 139 

Phe Gin Glu Arg Asp Val Cys Thr Lys Asp Arg Gin Cys Arg 

25 30 



^ 2) INFORMATION FOR SEQ ID NO: 221: 
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( i ) - SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 323 base pairs 

(B) TY?R: MUCLEIC ACID 

( C ) STRANDEDN'ESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 52.. 25 3 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 95 

region 10 . .216 
id R60167 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 235.. 319 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 98 

region 194 . , 
id R60167 
est 



278 



(ixj FEATURE: 

(A) NAME/KEY: Other 

(B) LOCATION: 143.. 258 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 92 

region 90. .205 
Ld R17888 
est 

(ix) FEATURE: 

(A) NAME/KEY: Other 

(B) LOCATION: 55.. 145 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFOR>LATION: identity 98 

region 1 . . 91 
id R:7388 
est 



(ix) FE.ATURE: 

(A) NA.ME/KEY: other 

(B) LOCATION: 235. .315 

(C) IDENTIFICATION METHOD: biastn 
{D) OTHER INFOPi-LATION: identity 99 

region 18 3 . < 
id P.17888 
est 



264 



(ix-. FEATURE: 

:A) NAiME/KEY: c::her 
:3) LOCATION: i;i..253 

:z) ide::tificaticm :-:ethod: oiastn 
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(D) OTHER INTORMATIOK: identity 91 

region 85 . . 202 
id N40052 
est 



(ix) FEATURE': 

(A; NAME/KEY: Other 

(B) LOCATION: 55. . 144 

(Ci IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 97 

region 1 . . 89 
id N40052 
est 



(ix) FEATURE: 

(A) NAt-dE/KSY: other 

(B) LOCATION: 235.. 319 ■ 

(C) IDENTIFICATION METHOD: blastn 
(0) OTHER INFORMATION: identity 97 

region 130. .264 
id N40052 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 58.. 257 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 98 

region 1. .200 
id AA039912 
est 



(ix) FEATURE; 

(A) N.AME/KEY: other 

(B) LOCATION: 248.. 319 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 98 

region 190. .261 
id .AA039912 
est 



(ix) FE.ATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 50.. 319 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: iaentity 97 

region 1 . . 270 
id R54127 



(ix) FEATURE: 

(A) N.^VME/KEY: sig_peptide 
(5) LOCATION: 90.. 194 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
{:>) OTHER INFORMATION: score 8.1 

seq NLLLLHCV3RSHS/QN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 



iji^.GGCC TGACCACGCA GTTCTTGGGT CTGT3CTGCT GGCCTGGGGT TGTGGTTG.-.G 



6C 
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GCCGTGTCTC CGCTCCTGT3 CCCGGGAAG ATG GTG CTA GG7 GGT TGC CCG GT7 113 

Mftt Val Leu Giy Gly Cys FrD Vai 
-35 -30 

AGT TAG TTA CTT CTG TGC GGC CAG GCG GCT TTG CTG CTG GGG AAT TTA 161 
Ser Tyr Leu Leu Leu Cys Gly Gin Ala Aia Leu Leu Leu Gly Asn Leu 
-25 -20 -15 

CTT CTG CTG CAT TGT GTG TCT CGG AGC CAC TCG CAA A.AT GCG ACC GCT 209 
Leu Leu Leu His Cys Val Ser Arg Ser His Ser Gin Asn Ala Thr Ala 
-10 -5 1 5 

GAG CCT GAG CTC ACA TCC GCT GGC GCC CCC AGC CGG AGG GCC CCG GGG 25"? 
Giu Pro Giu Leu Thr Ser Aia Giy Aia Pro Ser Arg Arg Ala Pro Gly 
10 15 20 

GTG CTG CGA GCT GGG AAT ATG GCG ACC CCC ACT CTC CGG TCA TCC TCT 305 
Vai Leu Arg Aia Gly Asn Met Ala Thr Pro Thr Leu Arg Ser Ser Ser 
25 30 35 

GCT CTT ACC TAG CTT GGG 323 
Aia Leu Thr Tyr Leu Giy 
40 



(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQCENCE CHARACTERISTICS: 

(A) LENGTH: 165 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

<ii) MOLECULE TYPg: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(i:<) FEATURE: 

(A) NAME/KEY: Other 
(3) LOCATION: 31.. 143 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: idenLiuy 99 

region 271 . . 383 

id W16767 

est 

(i:<) FEATURE: 

(A) NAME/ KEY: sig_pepcice 
(3) LOCATION: 34.. 87 

(C) IDENTIFICATION METHOD: Von Heijne ma-ri:< 
iO) OTHER INFORM.ATION: score 8.3 

seq LLSLSSLPLVLLG/WE 

«j<i) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 
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Met GIu Thr Gly Arg Leu Leu 
-15 

AGC CTC AGC TCT CTT CCT CTT GTT CTC C7A GGG TGG 3AG TAG AGC AGC iC2 

Ser Leu Ser Ser Leu Pro Leu Val Leu Leu Gly Trp Giu Tyr Ser Ser 

-10 * -5 i 5 

CAA ACG CTG AAC TTA GTC CCA TCC ACT TCC ATC TTA TCC TTT GTG CCC 150 
Gin T^ir Leu Asn Leu Vai Pro Ser Thr Ser He Leu Ser Phe Val Pre 
10 13 20 

TTC ATC CCC CGA GTG 
Phe He Pro Arg Vai 
25 



(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
CD TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 24.. 203 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 97 

region 1 . . 180 
id HSCIPFOSI 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 114.. 203 

(C) IDENTIFICATIOM METHOD: blastn 

(D) OTHER INFOE^JVTION: identity 98 

re:^io.T 7 3. . 1 67 
id H03709 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 35.. 107 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORT^-ATION: identity 98 

region 1 . . 73 
id H037C9 
est 

(ix) FE.ATURE: 

(A) NAME/KEY: si-j_peptide 
•3; LOCATION: 25.. 81 
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iC) IDENTIFICATION METHOD: Von Keijne niacrix 
(D) OTHER INrORMATIOM: score 8.2 

seq QVLALVLVAALWG/GT 



(xi) SEQUENCE DESCRIPTIOM: SEQ ID NO: 223: 



AAAGTAGAAG ACAGCGGCGT TGCC ATG GC3 GCG TCT CTG GGG CAG GTG TTG 51 

Met Ala Ala 3er Leu Gly Gin Val Leu 
-15 

GCT CTG GTG CTG GTG GCC GCT CTG TGG GGT GGC ACG CAG CCG CTG CTG 99 
Ala Leu Val Leu Val Ala Ala Leu Trp Gly Gly Thr Gin Pro Leu Leu 
-10 -5 1 5 

AAG CGG GCC TCC GCC GGC CTG CAG CGG GTT CAT GAG CCG ACC TGG GCC 147 
Lys Arg Ala Ser Ala Gly Leu Gin Arg Val His Glu Pro Thr Tr? Ala 
10 15 20 

CAG CAG TTG CTA CAG GAG ATG AAG ACC CTC TTC TTG AAT ACT GAG TAC 195 
Gin Gin Leu Leu Gin Glu Met Lys Thr Leu Phe Leu Asn Thr Glu Tyr 
25 3D 35 

CTG ATG 201 
Leu Met 
40 



(2) INFORMATION FOR SEQ 10 NO: 224: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDE0NE3S: DOUBLE 

(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HoRO Sapiens 
(D TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 6.. 119 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 92 

region i . .114 
id N83684 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 
(S) LOCATION: 147.. 24 1 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: icentity 97 

region \\2 . . P.Ofi 

Id N83634 

esc 



wo 99/06549 PCT/1B98/0t23t 

175 

(ix). .FEATURE: 

(A) NAME/KEY: other ' 

(B) LOCATION: 283.. 323 

(C) IDENTIFICATIOM METHOD: biastn 

(D) OTHER INFORMATION: identity IOC 

region 245 . . 285 
id N83684 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

{B) LOCATION: 327.. 361 

(C) IDENTIFICATION METHOD: biastn 

;D) OTHER INFORMATION: identity 97 

region 287 , . 321 

id N83684 

est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 150.. 299 

(C) IDENTIFICATION METHOD: biastn 
•D) OTHER INFORMATION: identity 93 

region 177 . . 326 

id H94179 . 
est ? 



(IX) FEATURE: 

(A) NAME/KEY: Other 

(B) LOCATION: 23. .109 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 90 

region 41 . . 127 
id H94179 
est 



(ix) FEATURE: 

(A) NAME/KEY: Other 
(3) LOCATION: 17.. 90 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 100 

region 1 . . 74 
Id .2^A,o?3069 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 144.. 194 

(C) IDENTIFICATION METHOD: biastn 
tO) OTHER INFORMATION: identity 90 

region 123. . 173 

id'AA093069 

est 



(i:<) FEATURE: 

(A) NAME/KEY: Other 

(B) LOCATION: 150.. 2.49 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER IMFORM.i\T:ON: iae.-^.tity 97 

regior. 193 . . 297 
\r. Tn7110 
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{Li) MOL£CUL£ TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Homo Sapiens 
(F; tissue TYPE: Oteru?? 

iix) FEATURE: 

(A) N.AME/KSY: other 
(3) LOCATION: 251.. 376 

(C) IDENTinCATIOM METHOD: biastn 

(D) OTHER INFORMATION: identity 100 

region 1. . 126 
id R16604 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 251.. 376 

(C) IDENTiriCATIOM METHOD: biastn 

(D) OTHER INFORMATION: identity 96 

region 1 . . 126 
id N99558 
est 

(ix) FE.ATURE: * 
{A) NAME/KEY: sig^peptide 
(3) LOCATION: 1337.195 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.6 

seq CLSCLLIPLALWS/II 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 



.ATTTGTTCTC C.-.AACAGTAA ACCAGTATTT CACACTGAGA TTGTCGGCTG CGGGTATATT 60 

CCA.ATTCCCC GTCTCCTCAT GAATATGAAG TGAAGGGCTC TGATtCCTKGG A.AGTGGTTCT 120 

AAGCAGGGCA .-.A .ATG GGG TCT CGG AA5 TGT 3GA GGC T:^C CTA ACT TGT TTG 171 
Met Gly Ser Arg Lys Cys Gly Gly Cys Leu Ser Cys Leu 
-20 -15 -10 

CTG ATT CCG CTT GCA CTT TGG .AGT ATA k" GTG AAC ATA TTA TTG TAT 219 
Leu lie Pro Leu Ala Leu Trp Ser lie lie Val Asn He Leu Leu Tyr 
-5 1 5 

TTC CCG A.AT GGG CAA ACT TCC TAT GCA TCC AGC AAT AAA CTC ACC A.AC 267 
Phe Pro Asn Gly Gin Thr Ser Tyr Ala Ser Ser Aba Lys Leu Thr Asn 
10 15 20 

TAC GTG TGC TAT TTT GAA GGA ATC TGT TTC TCA GGC ATC ATG ATG CTT 3L5 
Tyr Val Tip Tvr Phe Giu Gly He Cys Phe 3er Giy lie Met Mec Leu 
25 30 35 '^0 

ATA GTA ATA ATA GTT CTT CTG GTA CTG GAG .AAT AP.T AJVC AAC TAT AAA 363 
lie Val Thr Thr Val Leu Leu Val Leu Glu Asn Asn Asn Asn Tyr Lys 
45 50 55 

TGT TGC CA*. .-.aT GGG ^''Q 
Cys Cys 3.:' Scr Gly 
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(C) IDENTinCATIDN METHOD: blastn 

(D) -OTHER INFORMATIOt^: identity 93 

region 237, .26S 
id N39558 
est 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 31.. 93 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(0) OTHER INFORMATION: score 5.6 

seq CLSCLLIPLALWS/II 

{xil SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

GAAATCTCCC GCAGTTCTAA GCAGGGCAAA ATG GGG TCT CGG AAG TGT GGA GGC 54 

Met Gly Ser Arg Lys Cys Gly Gly 
-20 -15 

TGC CTA AGT TGT TTG CTG ATT CCG CTT GCA CTT TGG AGT ATA ATC GTG L02 
Cys Leu Ser Cys Leu Leu lie Pro Leu Ala Leu Trp Ser lie He Val 
-10 -5 1 

AAC ATA TTA TTG TAT TTC CCG AAT GGG CAA ACT TCC TAT GCA TCC AGC 150 
Asn He Leu Leu Tyr Phe Pro Asn Gly Gin Thr Ser Tyr Ala Ser Ser 
5 10 13 

AAT AAA CTC ACC AAC TAC GTG TGG TAT TTT GAA GGA ATC TGT TTC TCA 198 
Asn Lys Leu Thr Asn Tyr Val Trp Tyr Phe Glu Gly He Cys Phe Ser 
20 25 30 35 

GGC ATC ATG ATG CTT ATA GTA ACA ACA GTT CTT CTG GTA CTG GAG AAT 246 
Gly lie Met Met Leu He Val Thr Thr Val Leu Leu Val Leu Glu Asn 
40 45 50 

AAT AAC AAC TAT AAA TGT TGC CAG AGT GAA AAC TGC AGC AAA AAA TA'l' 294 
Asn Asn Asn Tyr Lys Cys Cys Gin Ser Glu Asn Cys Ser Lys Lys Tyr 
55 60 65 

GTG ACA CTG CTG TCA ATT ATC TTT TCT TCC CTC GGA ATT GCT TTT TCT '342 
Vai Thr Leu Leu Ser lie He Phe Ser Ser Leu Gly He Ala Phe Ser 
70 75 80 

GGA TAC TGC CTG GTC ATC TCT GCC TTG GGT CTT GTC CAA GGG CCA TAT 390 
Gly Tyr Cys Lau Vai lie Ser Ala Leu Gly Leu Val Gin Gly Pro Tyr 
85 90 95 

TGC CGC ACC CTT GAT GGC TGG GAG TAT GCT TTT GAA GGC ACT GCT GGA 4 33 
Cys Arg Thr Leu Asp Gly Trp Glu Tyr Ala Phe Glu Gly Thr Ala Gly 

too 105 110 115 



(2: INF0RK.^.TI::M for seq id NO: 212: 

(i; SEQUENCE CHARACTERISTICS: 

:A) LENGTH: 378 base pairs 
;3} TYPE: NUCLEIC ACID 
:-J) STRANDEDNESS: DOUBLE 
:j) TOPOLOGY: LINEAR 
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Met Ala Arg Cys Phe Ser Lea Val Leu Leu leu 
-15 -10 

ACT TCC ATC TGG ACC ACG AGG C7C CTG GTC CAA GGC TCT TTG CGT GCA 230 
7hr Ser lie Trp Thr Thr Arg Leu Leu Vai Gin Giy Ser Leu Arg Aia 
-5 1 5 10 



GA.=i GAG CTT TCC ATC CAG GTG TCA TGC AGA ATK A7G GGG ATC ACC CTT 
Glu Glu Leu Ser lie Gin Val Ser Cys Arg Xaa Met Giy lie Thr Leu 
15 20 25 



323 



GT3 AGC AAA AAG GCG AAC CAG CAG CTG AAT TTC AC A GAA GCT AAG 
Val Ser Lys Lys Ala Asn Gin Gin Leu Asn Phe Thr Glu Ala Lys 
30 35 40 



373 



[2) IMFORKATION FOR SEQ ID NO: 211: 



ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 38 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNES3: DOUBLE 

(D) TOPOLOGY; LINEAR 



(ii) MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 



(ix) FEATURE: 

(A) NAME/KEY: Other 

(B) LOCATION: 14 9.. 355 

(C) IDENTIFICATION METHOD: blastn 
(0) OTHER INFORMATION: identirry 100 

region I . . 207 
id P.16604 



(ix) FEATURE: 

(A) NAME/KEY: Other 

(3) LOCATION: 354.. 407 

{C> IDENTIFICATION METHOD: blascn 

(C) OTHER INFORMATION: identify 100 

recion 20" . . 260 
id R:6604 
est 



(ix) FEATURE: 

(A) NAME/KEY: Other 

:2) LOCATION: 149.. 362 

•:C; IDEMTIFICATION METHOD: oiastn 

(0) OTHER INFORMA.T ION: icencicy 98 

regicr. . . .214 
id N99550 
esc 



(i:'.! FEATURE: 

:;•.) na:-:S/ksy: 

'b' LCCATIOM: 



ot.^.er 
33C. .129 
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-35 -30 



'20 



GGC TCC AAG GAG TCT GGG AAG TGT AGT CCA G7T GGC TTA GCA GTA GT7 202 

Giy Ser Lys Gin Ser Giy Lys Cvs Ser ?ro Vai Giy leu Ala Vai Vai 

-20 " -15 -IC 

TCG TTG GGG G3G AGC CGA GGT TCC GGG P-^G GGG CTA GGC CGA CTG 24 7 

Ser Leu Giy Giy Ser Arg Giy Ser Giy Lys Giy Leu GLy Arg Leu 
-5 I 5 



(2) INFOEWATION FOR SCQ ID NO: 210: 

(i) SECUENC2 CHARACTERISTICS: 

(A) LENGTH: 373 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNES3: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Honto Sapiens 
{:) TISSUE TYPE: Testis 

(ix) EE^.TURE: 

(A) NAME/KEY: other 
(3) LOCATION: 252.-375 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 98 

region 1 . . 124 
id AA081350 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 313.. 375 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION; identity 93 

region 1 . . 58 
id AA046671 
est 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 200T.247 

•C) IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score 6.7 

sec CrSLVLLLTSIWT/TR 

(:<i) St:'JSNCS DESCRIPTION: 3EQ ID MO: 210: 



AATTTTTCCC CC.i.GTGACCT TGAC^^SiGTCA GAAGCT7GAA AGCAGGGAAA TCCGGATGTC 60 

TCGGTTATGA AC-TC-GAGCAC TGAGTGTGAC CCTCAACATA GTTCCAGAAC TCTCCATCCG 120 

GACTAGTTAT TGACCATCTG CCTCTCATAT CACCAGTGGC CATCTGAGGT GTTTCCCTGG IflO 

CTCTGi-SGC^G GT. AC", ATG GZC ^^.Cr. TGC T7C -.GC C7?. GTG TTG CTT CTC 232 
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(C) IDENTIrlCATION METHOD: biastn 
'D) OTHER IMFCRMATICN: identity 100 

recion 1 . .137 

id'R73005 

est 

•lix) FEATURE: 

iA) NAME/KEY: other 
(3) LOCATION: 90. .207 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 1 . . 128 
id N26942 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 86. .207 

(C) IDENTIFICATION METHOD: blastn 
(0) OTHER INFORMATION: identity 100 

region 1 . . 122 
Id WU2954 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 112.. 207 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 1 . . 96 
id T24 907 
est 

(ix) FEATURE: 

(A) NAME/KEY: Other 

(3) LOCATION: 137.. 207 

(C) IDENTIFICATION METKCD: blastn 

(0) OTHER INFORKATION: identity 100 

region 1 . . 71 

id AjA130938 

est 

(ix) FEATURE: 

(A) NA.ME/KEY: sig^peptide 
(3) LOCATION: 53.. 223 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 7,1 

seq VGL.AVVSLGGSRG/SG 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 



AACTGACAAG ACGTGGGCCA .AGAGGGGTCA CCGCCCCCGG AGCGGCGCGK AS ATG ATG 5.3 

Met Met 

GAA GTC GTA GTA GGA AAT GGC GTC G7G GCA TTG AGG GGC ATC CCT OCT 106 
Glu Vai Vai Val Gly Asn Gly Val Val Ala Leu Arg Giy lie Pro Pro 
-SfS - = 0 -45 -4C 

AGA ACC TCC AGG '-J-JK AGC TCG C3G A-^.G ACZ AGG TTC TCC GGA GAG AGA IS-i 
.Arg T.^.r Ser Ar:? Lvn Scr Ser Arg Lys Thr Arg Phe Cys Giy Glu Arc 
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SEQUENCE DESCRIPTION: SEQ ID NO: 207: 



AAGGGGGTCC AAAGTGCTCA GCCCCCGGGG CACAGCAGGA CGTTTGGGGG CCTTC7TTCA 50 

GCAGG3GACA GCCCGATTGG GGACA ATG GCG TC? CTT GGC CAC ATC T7G GTT 112 

Met Ala Ser Leu 3iy His lie Leu Vai 

-20 -15 

TTC TGT GTG GGT CTC C?C ACC ATG GCC AAG GCA GAA AGT CCA AAG G^JK 150 

Phe Cys Val Gly Lau Leu Thr Met Ala Lys Ala Giu Ser Pro Lys Glu 

-10 -5 1 5 

CAC GAC CCG AGG I'^Z 
His Asp Pro Arg 



(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A; LENGTH: 193 base pairs 
(3) TYPE: NUCLEIC ACID 
(C) STRANDEDNESS: DOUBLE 
(DJ TOPOLOGY: LINEAR 

ill) MOLECULE TYPt;: CDNA 

(vi) ORIGINAL SOURCE: 

(AJ ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

(ix) FEATURE: 

(A; NAME/KEY: other 

(3J LOCATION: 4S..192 

(C) IDENTIFICATION METHOD: biastn 

(D} OTHER INFORMATION: identity 98 

region 5 . . 151 
id R14326 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

O) LOCATION: 46.. 192 

(C) IDENTiriCATION METHOD: blastn 

(Oi OTHER INFORMATION: identity 91 

region 129. .275 
id W55137 
est 

(i;0 FE.ATURE: 

(A) MAME/KEY: other 

(3) LOCATION: 57.. 192 

iC) IDEMTIFICATION METHOD: biastn 

(C) OTHER INF0R:^TI0N: identity 91 

region 1 , . 136 
id W54115 
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(A) NAM£/K£Y: other 
(3) LOCATION: 57.. 192 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 91 

region 1. . 136 
id W75505 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 78.. 192 

[O IDENTIFICATION METHOD: blastn 
(D) OTHER INFORMATION: identity 97 

region 1. . 115 
id W20303 
est 



(ix) FEATURE: 

:A) NAi^JE/KEY: sig^peptide 

(B) LOCATION: 53.Tl21 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 7.3 

seq AL3LLLVSGSLLP/G? 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 



ACTCAATAAA TGTTTTCCGC ATTAAGACGC TTCTTAGGAG TCTTCATGCiA bG ATG TCG 58 

Met Ser 

GGT TCG TCG CTG CCC AGC GCC CTG GCC CTC TCG CTG TTG CTG GTC TCT 106 
Gly Ser Ser Leu Pro Ser Ala Leu Aia Leu Ser Leu Leu Leu Val Ser 
-20 -15 -10 

GGC TCC CTC CTC CCA GGG CCA GGC GCC GCT C.AG AAC GAG CCA .AGG ATT 154 
Gly Ser Leu Leu Pro Gly Pro Gly Aia Ala Gin Asn Giu Pro Arg lie 
-5 1 5 10 

GTC ACC AGT G.AA GAG GTC ATT ATT CGA GAC AGC CCC GTG 193 
Val Thr Ser Glu Glu Val lie He At^ Aou 3er Pro Val 
15 2D 



(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 base pairs 

;D) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

;D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CONA 

(vi) OP.IGIiiAL SOURCE: 

[k) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Ovary 

(ix) featurh:: 

(A) MAXE/KEV: Other 
(3^ LOCATION: 7L..207 
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6G 



(2) INrORMATION rOR SEQ ID NO: 213: 

(i; SEQ'JENCE CHARACTERISTICS: 

(A) LENGTH: 230 base pairs 

(B) TYPE: MUCLEIC ACID 

(C) STRANDEDNcSS: DOUBLE 
{D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATIOM: 53.. 227 

:C) IDENTIFICATION METHOD: 



biastn 



•;D) OTHER INFORMATION: 



identity 99 
region 190. , 
id AA043641 
est 



364 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 92.. 227 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 99 

region 1. . 136 
id N98697 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 
IB) LOCATION: 69.. 102 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFOE^TICN: identity 94 

region 393. .426 
id AA147010 



(i.v:) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 53.. 119 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 97 

region 435. .501 
id AA142334 
est 



(A) NAME/ KEY: sig^peptide 
;5) LOCATION: 155.. 209 

(C) IOENT:fiCAT:om method: Vcn Keijne matrix 
:0\ other INFORMATION: score 5.3 

seq ILFGVSFVFLTHC/TI 
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(xi:. 3EQUEKCE DESCRIPTIOM : SE2 ID NO: 213: 

AAAC7TTAGC ACATCA7TTT GGCATTCTAG AATA7TTCAT CTACCATACT A7AATTACC7 6D 

GAAGACA7r:A GGAGAA7ACA AACT7GCAGG TG7T77TC77 GGAGG7CG77 CAA7GGGC7C 120 

AAGAGGAGC7 GC7TC7G7AA 7GTGTCACA7 7GAGCCAG A7G ATG G7G A7G A77 TTG 176 

Met Met Vai Met lie :.eu 
-15 

770 GGG GTC 7CA 777 G7A 777 C77 ACC GAG 7GC ACC A7C CAA AGC AGC 224 
Phe Gly Val Ser ?he Vai Phe Leu 7hr His Cys 7hr lie Gin Ser Ser 
-10 -5 1 5 

7GC GGG 
Cys Gly 



{2) INF0RMA7I0N FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERIS7:CS: 

(A) LENG7H: 394 base pairs 

(3) 7Y?E: NUCLEIC ACID 

(C) S7RANDEDNESS : DOUBLE 

(01 TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hoaio Sapiens 
(f) TISSUE TYPE: Testis 



(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: JlU..3yj 

(CI IDEN7IFICA7I0N ME7HCD: biastn 

(D) 07HER INF0RMA7I0N: identity 93 

region 1 . . 34 

id HUM426A07B 

est 

(IX) FEA7URE: 

(A) NAME/KEY: siq^peptide 
(3) LOCATION: 2937.349 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) 07KER INFORMA7I0N: score 4.4 

seq VLVSLPHPHPALT/CC 

(xi) SEQUENCE DESCRIPTIO:: : SEQ ID NO: 214: 

AAACCTTGTT GCTAGGGACC GGGCGGT77G CGGCAACCG7 GGGCAC7GCT GA.ATTTGAAT 60 

TGAGGGGCGA GGGAAAAGTT TTCC7CAGGT GTGGTGGGGA GAGGGAGGCG GATGCCGGNG 120 

AAACCG7AGG KACGCGG7CA GAAAGGCGAC CCCC7CTCGG AG77GGAAAG GGACGCCTGG 130 

777CCCrCC;. .".aCCAACCGG GA7GG:-.=w-.G7 GAC77C.=i;i.TG AGA77GAAC7 7CAGC7GGA7 24 0 
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TGAAAGAGAG- aCTAGAAGTT CCGCTTGCCA 3CAGCCTCCT TAGTAGAGCG 3A ATG AGT 233 

Me- Ser 



AAT ACC CAC ACG GTG CTT GTC TCA C7T CCC CAT CCG CAC CCG GCC CTC 34 6 

Asn Thr His Thr Val Leu Val Ser Leu ?rc His Pro His Pro Ala Leu 

-IS -13 -5 

ACC TGC T3T CAC CTC GGC CWC CCA CAC CCG GTC CGC GCT CCC CGC CCG 394 

Thr Cys Cys His Leu Gly Xaa Pro His Pro Val Arg Ala Pre Arg Pro 

1 5 IC 13 



(2) :NFCR->iATION FOR SEQ ID NO: 215: 

(i) SEOUENCE CHARACTERISTICS: 

(A) LENGTH: 4 73 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: COMA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: 111.. 321 

vC) IDENTIFICATION METHOD: blascn 

(0) OTHER INFORMATION: identity 90 

region i . .211 
id N41784 
est 

(ix) FEATURE: 

(A) NAME/KEY: ot.her 
(3) LOCATION: 14 3.. 237 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 9*^ 

region 5. . 99 
Id T7Cii5 
est 

(ix) fcE-ATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 99.. 4 16 

(C) IDENTIFICATION METHOD: Vcn Heijne matrix 

(D) OTHER INFORMATION: score 4.1 

seq IITLACVPMTSFT/RN 

ixi) SEQUENCE DESCRIPTION: SEQ ID SO: 213: 



ACAGCATCCT TTTCAAGGAT ADCTG.AACAG AACCTTCTAA GTCTCAGACA CGTAAACCCA 60 

AGTGT-33C.=A AGGAACTCAT TGCTCTCG.AA ATGCATAT ATG TKG GTT TAT ACA CTC 116 

Met Xaa Val X/r Arg Leu 
-ICS 
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C?A ACT GAR AAG CCC PAC ACT ACT GTK CA.A GTT CCA GCC TTT CTT 164 

Gin Thr Gin Giu Lys Pro Asn Thr Tnr Val Gin Val Pro Ala ?he Leu 
-IOC -^5 -^0 -85 

CAA GAG CTG GTA GAT CGG GAT AAT TCC AAA TTT GAG GAG TGG TGT ATT 212 
Gin Giu Leu Val Asp Arg Asp Asn Ser Lys Phe Glu Glu Trp Cys lie 
-80 -75 -70 

G,AA A-G GCT GAG ATG CGT AAS AAA GTG TGG ATA AA.G GAA AAG CAA AAC 250 
Glu Met Ala Giu Met Arg Xaa Lys Val Trp He Lys Glu Lys 31a Asn 
-55 -60 -55 

ACG AAG AGG TTA AGG AGC TGT ACC AAA GGT TAC CTG CTG GAG CTG AGC 308 
Thr Lys Arq Leu Arg Ser Cys Thr Lys Giy Tyr Leu Leu Glu Leu Ser 
-50 -45 -40 

CC-^ ATG AGT TTG TCT CTC TGG AAT GGC TGC AAA AGT GGT TGG ATG AAT 356 
Pro Met Ser Leu Ser Leu Trp Asn Gly Cys Lys Ser Giy Trp Met Asn 
-35 -30 -25 

CAG CAA NTA CCA AAC CTA TTG ATA ATC ACG CTT GCC TGT GTT CCC ATG 404 
Gin Gin Xaa ?ro Asn Leu Leu lie lie Thr Leu Ala Cys Val Pro Met 
-20 -IS -IC -5 

ACA AGC ^TC ACC CGG AAT A.^A ATA TCA ATT ATG AAG AGG ATA TCT GAA 4 52 
Thr Ser Phe Tnr Arg Asn Lys He Ser He Met Lys Arg He Ser Glu 
1 5 ^0 



TAT GCA GCK GAC ATT TTC TAT 
Tyr Ala Ala Asp He Phe Tyr 
15 



473 



(2) IMTORMATION* TOR SEQ ID NO: 216: 

;i) SEQ'JHINCE: CHARACTEKISTICS: 

(A) LENGTH: 134 base pairs 
■2) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

lii) MOLECULE TYPE: CONA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Testis 

ti:<) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 63,. 133 

(C; IDEMTIiriCATIOM HZTHOD: biascn 
11) OTHER IMrORMATION: :aen-ity 97 

region L52. .222 
id A/-.0'J397 4 
est 

ii:-:) THI-.TURE: 

(r.) NAME/KEY: otner 
[l. LOCATION: 93.. 133 

IDENTinCATION METHOC*: cidsm 
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(D) OTHER IMFORMATION: iden^iiry 100 

region 1 . . 35 
ic WC55C1 
esc 

(ix) TEATURE:: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 54.. 116 

(C) IDENTIFICATION METHOD: Vcn Hei^ne matrix 
(C) OTHER INFORMATION: score 3.9 

seq LI.AWIIILLIFT/3V 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 



AGACTTTCCT GATTTAGCTT ATGGAAGAGG AACCAGAAAT TTGTCCTTGA ATA ATG 56 

Met 

TTT CCC GTG TTG GGC TGG ATC TTG ATA GCA GTW GTY ATC ATC ATT CTT 104 
Phe Pro Vai Leu Gly Trp He Leu He Ala Vai Val He lie He Leu 
"20 -15 -10 -5 



CTG ATT TTT ACA TCT GTC ACC CGA T3C 
Leu He Phe Thr Ser Val ?hr Arg Cys 
i 5 



CTG 134 
Leu 



(2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 202 base pairs 

(3) TYPE: NOCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(Dl TOPOLOGY: LINEAR 

(ii) KOLECULE TYPE: CDNA 

(VL) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapie.-s 
(F) TISSUE TYPE: Testis 

(ix) FEATURE; 

(A) NAME/KEY: other 
(8) LOCATION: 153.. 199 
(C) IDENTIFICATION METHOD: bUstn 
(0) OTHER INFORMATION: i3erti:y 97 

regi on 1 . . 4 7 
id R1420"' 
est 

(i:<) FEATURE: 

(Al NAME /KEY: sig_peptide 
(3) LOCATION: 8.. 64 

;C) IDENTIFICATION METHCD: Von K9i:ne naCrix. 
(0) OTHER INFORMATION: score 3.9 

seq SVCLC?CLNKCQ$/EN 

SEQUENCE DESCRIPTION: SEQ ID ::0: 217 : 
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aagca.j;g ATG TTC TCC TGC ?GT ATC TCA GTT TGT CTA TGT CCT T3r CTC 4 9 

Mar Pna Ser Cvs Cys lie Ser Vai Cya Leu Cys Pro Cvs Leu 
-15 -10 



AAC AAA GGC CAA AG? GAG AAT CTT TCC AGA GAG TGC GGW CAT TG"; CTG 97 

Asn Lys Giy Gin Ser Glu Asn Leu Ser Arg Asp Cys Giy His Tro Leu 

-5 1 5 n 

AAC CCT CAC CAT CGA C3C CTC TGG CCA TTT GGC AGA AGG CAC CCA CAG 14 5 

Asr. Pro His His Arg Arg Leu Trp Fro ?he Giy Arq Arg His Pro Gin 

15 20 25 

GAT TGT GGA CTC TTC CAA GAT TCA CAA TGR TAT GGT GAA TCC A^iA GAC 193 

Asp Cys GLy Leu Phs Gin Asp Ser Gin Xaa Tyr Giy Giu Ser Lys Asp 

30 35 40 

TGG AAC GGG 202 
Trp Asn Giy 
45 



(2) IMFORKOiTION FOR SoQ ID NO: 218: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 406 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEONESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

;A) ORGANISM: Hono Sapiens 
(D TISSUE TYPE: Uterus 



(ixi FEATURE: 

:A) NAiME/KEY: other 

:3) LOCATION: 333.. 403 

fC) IDENTIFICATION METHOD: biastin 

(D) OTHER INFORMATION: identify 58 

reqior. '33. .103 
id W73795 
es - 



(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 333. .403 

(C) IDENTIFICATION METnOD: biastn 

(D) OTHER INFORilATICN: identity 93 

region 405 . .470 
id .Vil5:030 
est 



(i::) Fi-'XTURE: 

(A) NAME /KEY: other 

iS) L0CAT:0N: 333.. 403 

(C) IDENTIFICATION METHOD: blastn 

'j; OTHER IMFORt'IATION: identity 93 

region 6 . . 7 1 
id'K43^4.0 
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est 



(ix) FEATURE; 

(A) MAME/KEY: Olher 

(3) 20CATI0N: 333.. 403 

(CI IDE^JTinCATION METHOD: blastn 

(O: OTHER INFORMATION: identity 93 

region 73 . . 138 
id R99176 
est 



(ix) FEATURE: 

(A) SAME/KEY: other 
(3) LOCATION: 338.. 403 

(C) IDENTIFICATION METHOD: bla'stn 

(D) OTHER INFORMATION: identity 93 

region 48 . . 113 
id W79571 
est 



(ix) FEATURE: 

(A) NAME/KEY: sig_pe?tide 

(B) LOCATION: 1437.229 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq LTYLLLLSPIKYP/LD 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 



ATCGTATTGG CACAGTTCTC TATGTAAGCA ATTTGAGAGG GAAGCAAAGG GGAAAAGTTT 60 

GAGTTAGCTG TTCTCTGTCC TAGAATTTCC CTGC.ATTAAT CTTGTCCTTG A.AAATATATA 120 

TAATACTGGT CCCTT.AAACT CC ATG AGG CTT TGT CTC ATT ATG TAT TGT TCT 172 

Met Arg Leu Cys Leu lie Met Tyr Cys Ser 
-25 -20 

TTT GGT ACC CTT TCC CAC TTA ACT TAC CTT TTG CTC CTA AGT CCT ATA 220 
o^.e Gly Thr Leu Ser His Leu Thr Tyr Leu Leu Leu Leu Ser Pro lie 
-15 -10 -5 

A.^ TAC CCC TTG GAT CTG GAT TTT TTA TAC CCG ATT TTC TCC ACT GTG 268 
Lvs Tyr Pro Leu Asp Leu Asp Phe Leu Tyr Pro lie Phe Ser Thr Val 
1 5 10 

TAT AAA AGG TAT ATT GTG ACT GTA AAT TTT TGT ATA TCA TGT TCT GAG 316 
Tvr Lys Arg Tyr lie Val Thr Val Asn Phe Cys He Ser Cys Ser Glu 
15 20 25 

A3C TTC TTA CTT TCT GAT CTC .ATA GCA CTA TTC CTG ATC AGA GAA CTC 364 
Ser Phe Leu Leu Ser A3D Leu He Ala Leu Phe Leu He Arg Glu Leu 
30 35 40 45 

Z.\Q TTG CTT C.=wA CAC ACA GTA TCA GTA GTG CAG CCA CCC ACG 4 06 

nn Leu Leu Gin His Thr Val Ser Val Vai Gin Pro Pro Thr 
50 S5 



:i IMFORMJVTIOr. FOR SZQ ID MO: 219: 



wo 99/06549 



168 



PCT/IB98/0I23I 



(i) 3EQ-JENCE CHARACTERISTICS: 

(A) LENGTH: 210 base paizs 
(3) TYPE: MUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 



(lil MGLEC'JLE TYPE: CDNA 



(VI) CP.IGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: compieraent (118. .206) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: icienrity 97 

regior^ 144 . . 232 
id 777881 
est 



(ix) FEATURE: 

(A) N.AME/KEY: other ^• 

(B) LOCATION: complement : 64 118 ) i* 

(C) IDENTIFICATION METHOD: blastn 
CD) OTHER INFORMATION: identity 98 

region 231. .285 
id T77881 
est 



(ix) FEATURE: 

(A) NAME/KEY: Other 

(B) LOCATION: complement (126 206) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: idsn-ity 100 

region 139. .219 
.d R01713 
est 



(,LX) FEATURE: 

(A) NAME/KEY: sigjjeptide 

(B) LOCATION: 70.. 14 7 

(C) IDENTIFICATION METHDC: Von Heijne matrix 

(D) OTHER INFORMATION: score 10.5 

seq LLLALLLPVQVSS/FV 

(Xi) SEQUENCE DESCRIPTION: SZQ ID tIC: 219: 



AACCAGCCAG GAGCCACCCA TCCTCCAGCA CACTGA3CA3 C.AAGCTGGAC AC.=^CGGCACA 60 

CTGATCCAA ATG GGT AAG GGG ATG GTG GC3 ATG CTC ATT CTG GGT CTG CTA 

Met Giy Lvs Glv Met Vai Ala Met Leu lie Leu Giy Leu Leu 
-25 * -iO 



r-rr CTG GCG CTG CTC CTA CCC GTG CAG GTT TCT TCA TTT GTT CCT TTA 
Ala Leu Leu Leu Pro Val Glr. Val £er 3cr Ph9 Val Pro Leu 
-10 -5 i 



GCT .'.CT CCA GCC CrS\ .\ZC ACA AAG CCC 



111 
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Thr Ser Met Pro Glu Ala Thr Ala Ala 31u Thr Thr Lys Pro Ser Asn 
5 10 15 20 

GGG 210 
Gly 



(2) INTORMATICN FOR 3EQ ID NO: 220: 

(1} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 189 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 2..17C 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 98 

region 4 . . 172 
id H56777 
est 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: 25.787 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 9.5 

seq LLVLr^LLANVQG/PG 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 



AAGAAAGGCT GGCCTCTCTT CAAC ATG GGA TCT TCT GGA CTT TTG AGC CTC 51 

Met Gly Ser Ser Giv Leu Leu Ser Leu 
-20 ' -15 

CTG GTG CTA TTC GTC CTC TTA GCG AAT GTC CAG GGA CCT OCT CTC ACT 99 
Lsu Val Leu Phe Val Leu Leu Ala Asn Val Gin Gly Pro Gly Leu Thr 
-10 -5 1 

GAT TGG TTA TTT CCC AGG AGA TGT CCC AAA ATC AGA GP-A GAA TGT GAA 14 7 
Aso Trp Lea Phe Pro Arg Arg Cys Pro Lys lie Arg Giu Glu Cys Glu 
5 10 15 20 

TTC CAA GAA AGG GAT GTG TGT ACA AAG GAC AGA CAA TGC CGA 139 
Phe Gin Glu Arg Asp Val Cys Thr Lys Asp kzq Gin Cys Arg 
25 30 



'2) INFORMATION FOR SEQ ID NO: 221: 
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{D- SEQUENCE CHARACTERISTICS : 

(A: LENGTH: 323 base pairs 
(3) TY?r,: MUCLEIC ACID 

(C) 3TRANDEDNESS : DOUBLE 

(D) TQPOLOGY: LINEAR 

(ii) KOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hcrao Sapiens 
(?) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 52.. 253 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 95 

region 10 . .216 
id R60167 
esz 

iix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 235.. 319 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 98 

region 194 . .278 
id R60167 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 143.. 253 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: ider^ti-y 92 

region 90. .205 

id'R17S88 

esc 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 55.. 145 

(C) IDENTIFICATION iMETHCj; oi35tn 

(D) OTHER INFORMATION: l.-:*-: l:/ 98 

r^.zizr. .91 
id .-.:7838 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 235.. 315 

(C) IDENTIFICATION MSTHOD . oiascn 
iO) OTHER INFORMATION: idr.-.-iry 93 

ra'Tior. 183. .26 
id'r.:7133 



fixi rE.=wTURE: 

•A) N.A^ME/KEY: Other 
:3) LOCATION: i;i..253 

:z) ide::ti?icatic:: :':et.kod: c.ascn 
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(0) OTHER INFORMATION: identity 91 

region 85. ,202 
id N40052 
est 



(ix) "EATaRE"; 

(A) NAME/KEY: Other 
(3) LOCATION: 56.. 144 

[C] IDENTIFICATION METHOD: blastn 

(D) OT.KER INFORMATION; identity 97 

region 1 . . 99 
id N40052 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) . LOCATION: 235.. 319 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 97 

region 130. .264 
id N40052 
est 



(ix) FEATURE: . 

(A) NAME/KEY: Other 
(3) LOCATION: 58.. 257 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 98 

region 1..200 
id AA039912 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 248.. 319 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 98 

region 190. .261 
id .2A039912 



(ixl FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 50.. 319 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 97 

region 1 . . 270 
id R54127 



(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(5) L(XATION: 90.. 194 

(C) IDENTIFICATION METHOD: Von Heijrs matrix 
(H) OTHER INFORMATION: score 8.3 

seq KLLLLHCV3RSHS/QN 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 



ZQ^ZCC TGACCACGCA GTTCTTGGG7 CTGTGCTGCT GGCCTGGGGT TCTGGTTGAG 6C 
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GCCG^GTCTC CGCTCCTGT3 CCCGGGPAG ATG 3TG CTA GG? GGT TGC CCG GTT i:3 

Mftr Val Leu Gly G.y Cys PrD va* 
-35 -30 

3T Ti^C TTA CTT CTG TGC GGC CA3 GCG GCT TTG CTG CTG GGG AAT TTA 161 
er ^yr Leu Leu Leu Cys Giy Gin Ala Ala Leu Leu Leu G.y Asn ueu 

-po -15 



Se 

-25 -20 

CTT CTG CTG CAT 7GT GTG TCT CGG AGC CAC TCG CA.A P-AT GCG ACC GCT 

L*u Leu Leu His Cys Val Ser Arq Ser His Ser Gin Asn A^a Thr Ala 

-10 -5 1 ^ 

GAG CCT GAG CTC ACA TCC GCT GGC GCC CCC AGC CGG AGG GCC CCG GGG 

Glu Pro Giu Leu Thr Ser Ala Gly Ala Pro Ser Arg Arg «la Pro Gl^ 



10 



15 20 



GCT CTT ACC TAC CTT GGG 
Ala Leu Thr Tyr Leu Gly 
43 



239 



25-' 



GTG CTG CGA CCT GGG AAT ATG GCG ACC CCC ACT CTC CGG TCA TCC TCT 305 
vll Leu Arg Ala Gly Asn Met Aia Thr Pro Thr Leu Arg Ser Ser Ser 
25 30 35 



323 



(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SIQCENCS CHARACTERISTICS: 

(A) LENGTH: 165 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(Li) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 



{i:0 FEATURE: 

(A) NAME/KEY: Other 

(B) LOCATION: 31.. 143 

(C) IDENTIFICATION METHOD: blastn 
(0) OTHER INFORMATION: identicy 99 

region 271., 383 

id wieis"? 

est 



(i:<) FEATURE: 

(A) NAME/KEY: 5ig_pepcide 
(3) LOCATION: 34 . .87 

(C) IDENTIFICATION METHOD: Von Keijne .Tia-rix 
'D- OTHER INFORMATION: score 8.3 

sea LL3LSSLPLVLLG/WE 



SEQUENCE DESCRIPTION: SEQ ID HO: 222: 



^.ACTAGACTT TTG.ACCCCTT GGGCAACAGC CA3 ATG GAC 



CGC CI 
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Met GIu Thr Gly Arg Leu Leu 
-15 



ir- r-r AGC TCT CTT CCT CTT GTT CTC CTA GGG TGG 3AG TAG AGC AGC 102 
Ser L8U Sec Ser Leu Pro Leu Val Leu Leu Gly Trp G:u Tyr Ser 3er 
-10 -5 

CA.A ACG CTG A.:VC TTA GTC CCA TCC ACT TCC ATC TTA TCC T7T GTG CCC 150 
G^^^ Leu Asn Leu Val Pro Ser Thr Ser He leu Ser Phe Val Pre 
10 15 20 

165 

TTC ATC CCC CGA GTG 
Phe He Pro Arg Val 
25 



(2) INFORMATION FOR SEQ ID MO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 base pairs 
(3) TYPE: NUCLEIC ACID 
(C) 3TRANDEDNESS : DOUBLE 
(0) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) OP.IGINAL SOURCE; 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 



(ix) FEATURE: 

(A) NAME/KEY: Other 

(3) LOCATION: 24.. 203 

(C) IDENTIFICATION METHOD: blastn 

(0) OTHER INFORMATION: identity 97 

region 1 . . 180 
id HSClPFOSi 
est 



(ix) FEATURE: 

(A) NAME/KEY: Other 

(3) LOCATION: 114.. 203 

(C) IDENTIFICATION METHOD: blastn 

(Q) OTHER INFORMATION: identity 98 

region 73 . .167 
id H03709 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: 35.. 107 

(C: IDENTIFICATION METHOD: blastn 

(01 OTHER INFORr>l=vTION: identity 98 

region 1 . . 73 
id H037C9 
est 



(iv) :E.arJRE: 

(A) NAME/KEY: si'j_pe?tide 
•:3: LCC^^TION: 25.-31 
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iC) IDENTIFICATION METHOD: Von Keijne siacrix 
(0) OTHER INFORMATION: score 8.2 

seq QVXJ!wLVLVAALWG/GT 

(xi) SEQUENCE DESCRIPTIOM : SEQ ID NO: 223: 



AP2VGTAGAAG ACAGCGGCGT TGCC ATG GCG GCG 7CT CTG GGG CAG GTG T ,o :>l 

Met Ala Ala Ser Leu Giy Gin Val Leu 
-15 

3CT CTG GTG CTG GTG GCC GCT CTG TGG GGT GGC ACG CAG CCG CTG CTG 99 
Ala Leu val Leu Val Ala Ala Leu Trp Gly Gly Thr Gin Pro Leu ueu 
-10 -5 1 5 

AAG CGG GCC TCC GCC GGC CTG CAG CGG GTT CAT GAG CCG ACC TGG GCC 14 7 
Lvs Arg Ala Ser Ala Gly Leu Gin Arg Val His Glu Pro Thr Tr? Ala 
10 15 20 

CAG CAG TTG CTA CAG GAG ATG AAG ACC CTC TTC TTG AAT ACT GAG TAC 195 
Gin Gin Leu Leu Gin Glu Met Lys Thr Leu Phe Leu Asn Thr Glu Tyr 
25 30 35 



CTG ATG 
Leu Met 
40 



201 



(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNE3S : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Horao Sapiens 
(D TISSUE TYPE: Spleen 



(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 6. . 110 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 92 

region 1 . . 114 
id N83684 
est 



(ix) FEATURE: 

(A) MAME/KEY: other 

(3) LOCATION: 147.. 241 

(CI IDENTIFICATION METHOD: blastn 

(D) OTHER INF0R21AT:0N: identity 97 

ic N83684 
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(ix). .FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 283.. 323 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity IOC 

reqiop. 245. .285 

id N83684 

est 



(ix) FEATURE: 

;A) KAME/KEY: other 

(B) LOCATION: 327. . 361 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 97 

region 287 . . 321 
id N83684 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 150. .299 

(C) IDENTIFICATION METHOD: biastn 
{D) OTHER INFORMATION: identity 93 

region 177 . . 326 
id H94179 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 23. .109 

(C) IDENTIFICATION METHOD: biastn 
(0) OTHER INFORMATION: identity 90 

region 41. . 127 
id H94179 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 17. .90 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 100 

reqion 1 . .74 
Id .=1.^093069 



(ix) FEATURE: 

(A) NAME/ KEY: other 

(B) LOCATION: 144.. 194 

(C) IDENTIFICATION METHOD: ol^otn 

(D) OTHER INFORMATION: identicy 90 

recior. 123. .173 
iu A.2i.093069 



(i;0 FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 150.. 249 

(C) IDENTIFICATION METHOD- blastr. 

(D) OTHER IMc0?KAT:0N: i--er>.-ity 97 

re.^ic- 133.. 297 
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est 



(ix) FHIATURE: 

(A) NAME/KSY: sig_peptide 
(3) LOCATION: 211. .387 

(C) I0EM7IFICA7I0N METHOD: Von Keiine niatrix 
(0) OTHER INFORMATION: score 8.1 

seq rLLGISNLSQVRA/SN 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 



AGCGGGTGTT GAGAGCGGTG TGGTAGGTGT TGTAGCCGCT ATGGTGAAGT TCGCTTTGTA 60 

GCGGCCCCGG CTAGAGAGTT GGCCTGTTCC CTGCCTTTGT GACCCGGAGG AGCTTTTGGG 120 

GTGCGTCAAG CCCCTGGCCT GAGGCAGCGA DCTGGTTTGT GGCCTGTTTG ATTCCTGTCA 180 

GAGGTTTGCT GACCCAAGAC AGTATCGAAA ATG CAT ATT AAG TCA ATT ATT CTA 234 

Met His lie Lys Ser lie He Leu 
-55 

GAG GGA TTC AAG TCC TAT GCT CAG AGG ACC GAA GTC AAT GGT TTT GAC 282 
Glu Giy Phe Lys Ser Tyr Ala Gin Arg Thr Glu Val Asn Gly Phe Asp 
-50 -45 -40 

CCC CTC TTC AAT GCT ATC ACT GGC TTA AA7 GGT AGT GGG AAA TCC AAC 330 
Pro Leu Phe Asn Ala He Thr Giy Leu Asn Giy Ser Giy Lys Ser Asn 
-35 -30 -25 -20 

ATA TTG GAC TCC ATC TGC TTT TTG CTG GGC ATC TCC AAC CTG TCT CAG 378 
He Leu Asp Ser He Cys Phe Leu Leu Gly He Ser Asn Leu Ser Gin 
-15 -10 -5 

GTT CGG GCT TCT AAT TTA CAA GAT TTA GTT TAC AAA AAT GGG CAG GCT 426 
Val Arg Ala Ser Asn Leu Gin Asp Leu Val Tyr Lys Asn Giy Gin Ala 
1 5 10 

GGT ATT ACC AAA GCC TCT GTG TCA ATC AMT TTT GAT 4 62 

Giy lie Thr Lys Ala Ser Val Ser He Xaa Phe Ai>u 
15_ 20 25 



(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 473 base pairs 

(D) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) CP.IGINAL SOURCE: 

(A) ORGANISM: Hono Sapier.a 
(F) TISSUE TYPE: Testis 

{ix) FEATURE: 

{A) NAME/KEY: other 
:=) LOCATION: 280.. 404 
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{C) rOENTinCATION METHOD: biastr. 
•:D) other I M FORMAT ION: identity 97 

region 155 . . 289 

id N45465 

esz 

(ix) FEATURE: 

' (A) NAME/KEY: other 

(B) LOCATION: 76. ,168 

(C) IDENTIFICATION METHOD: blastn 
(D> OTHER INFORMATION: identity 98 

reqion 52 . .14 4 
id N46466 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 
(B> LOCATION: 405. -469 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 96 

region 289. . 353 
id N46466 
est 



(ix) FEATURE: 

{k) NAME/KEY: other 

<B) LOCATION: complement ( 4 05 4 69 ) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 96 

region 180. .244 
id H86648 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: conplement ( 34 3 . . 404 ) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 95 

region 244 . . 305 
id W86648 
est 

(ix) rE.=^TUR£: 

(A) NAME/KEY: other 

(B) LOCATION: complement { 297 34 7 ) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 302. .352 
id W86640 
est 

(ix) r-:ATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 273.. 358 

iC) IDENTIFICATION METHOD: blastn 
(D) OTHER INFORMATION: identity 96 

region 39 . .123 
id W36523 
est 



FEATURE: 
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(A) NAME/KEY: other 

(B) LOCATION: 357.. 4C4 

iC) IDENTTFTCATTCN METHOD: blastn 
(0) OTHER INFORMATION: identity 91 

region 123. . 170 
id W86523 
est 

(ix) FEATURE: 

(A) NAME /KEY: Other 

(B) LOCATION: 405.. 436 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 96 

region 170. .201 
id W86523 
est 

(ix) FEATURE: 

(A) NA^^E/KEY: sig peptide 

(B) LOCATION: 2857.341 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(D> OTHER INFORMATION: score 8 

seq WGFLCVLFTAVHP/A? 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 



AAGMSAGGGG AAGCGCCCAA GGTCACACAG CTGGGATGT3 
CCTGTTCCCA TTGCTGGACA GCTGCCACAT CTGGCACCCA 
GCCCAAGCCC CGGGGGTGGC GGGGGATCCT AGAGGAAAGT 
GCAGAGCCAG AGTAGAAAAC TGAGGCTCTG AGAGATGAAG 
GCACAGTCAC ATCCTACTGA ACATCATCCT GTTCTCTGGG 



GCAGAGCTGG GGTTCCAGCT 60 

ATTTAGGACC CCGCGGGGAG 120 

GGCAAGGCCA GGACCCTGGA 180 

CTACTTGCCA AGGTCACGCA 240 

TGGA ATG TCA CCA TCG 296 
Met Ser Pro Ser 



CCC AGG TGG GGA TTT TTG TGT GTT TTG TTC ACT 
Pro Arg Trp Giy Phe Leu Cys Val Leu Pfte Thr 
-15 . -10 -5 



GCT GTA CAC CCA GCC 
Ala val His Pro Ala 
1 



344 



CCC AGC ACA GCG CCT GTC CAG GAC AAG TGC CCA 
Pro Ser Thr Ala Pro Val Gin Asp Lys Cys Fro 
5 10 



GTA AAC ACT TGG GAA 
Val Asn Thr Trp Giu 
15 



392 



GCA ATG CAB VMG GTC CTC CCA GCA GCT CCT GCA 
Ala Met Xaa Xaa Val Leu Pro Ala Ala Pro Ala 
20 25 



AAC AGA CCC CCG ACC 
.Asn Arg Pro Pro Thr 
30 



440 



CAA GCC TTT CCT TCT GCM TCC ACT GCC ACA GGG 
Gin Ala Phe Pro Ser Ala Ser Thr Aia Thr Glv 
35 40 * 



473 



(25 INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 base pairs 
•S) TYPE: NUCLEIC ACID 
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(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEfiR 



(ii) MOLECULE TY?E: GDNA 

(vij ORIGINAL SOURCE : 

[A) ORGANISM: Homo Sapiens 
(F> TISSUE TYPE: Testis 



(ix) FEATURE: 

(A) NAME/ECEY: other 

(B) LOCATION: complement (1 189) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 99 

region 1. .189 
id R47502 
est 



(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 56.. 127 

(C) IDENTIFICATION METHOD: Von Heijne Mtrix 

(D) OTHER INFORMATION: score 7.8 

seq FLLCLCIAYWAST/AV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 



AATTCTCATC GCGATTGCAC TCATCAAAGA AGCCAGCAGG GCTGTGGGAT ACGTC ATG 53 

Met 

TGC TCC TTG CTC TAG CCA CTG GTC ACC TTC TTC TTG CTG TGC CTC TGC 106 
Cys Ser Leu Leu Tyr Pro Leu Val Thr Phe Phe Leu Leu Cys Leu Cys 
-20 -15 -10 

ATC GCC TAC TGG GCC AGC ACT GCT GTC TTC CTG TCC ACT TCC A.=vC GAA 154 
lie Ala Tyr Trp Ala Ser Thr Ala Val Phe Leu Ser Thr Ser Asa Glu 
-5 15 

GCG GTC TAT AAG ATC TTT GAT GAG AGC GCC TGG CCA TTT ACT GCG AAA 202 
Ala Val Tyr Lys lie Phe Asp Aso Ser Pro Cys Pro Phe Thr Ala Lys 
10 ' 15 * 20 25 

ACC TGG AAG CCA GAG ACC TTG GCC TCC TCC AAT GAG GCC CGC CAT GGG 2 50 
Thr Cys Asn Pro Glu Thr Phe Pro Ser Ser Asn Glu Pro Arg His Gly 
30 35 40 



(2) INFORMATION FOR SEQ ID NO: 227: 



(i) SEQUENCE CHARACTERISTICS: 

(Ai LENGTH: 176 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) OPr,AM:SM: Homo sapiens 
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(F) TISSUE TYPE: Spleen 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: complement ( 5 1 119) 

(C) IDENTiriCATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 404 . .472 
id AA099571 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: complement U 18 174 ) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 348. .404 
id AA099571 
est 



(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 24.. 71 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 7.6 

seq FLFFSTLFSSIFT/EA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 



AAAAGCTTTG GAGATATTGA ATC ATG TTA CCA TTT CTG TTT TTT TCC ACC CTG 53 

Met Leu Pro Phe Leu Phe Phe Ser Thr Leu 
-15 -10 

TTT TCT TCC ATA TTT ACT GAA GCT CAG AAG CAG TAT TGG GTC TGC AAC 101 
ohe Ser Ser He Phe Thr Glu Ala Gin Lys Gin Tyr Trp Val Cys Asn 
" ' -5 1 5 10 

TCA TCC GAT GCA AGT ATT CAT ACA CCT ACT GTG ATA AAA TGC AAT ACC 149 
Se- Ser Asp Ala Ser He His Thr Pro Thr Val He Lys Cys Asn Thr 
15 20 . 25 

CAA TTT CAA TTA ATG TTA ACC CCT GGG ^'^^ 
Gin Phe Gin Leu Met Leu Thr Pro Gly 
30 35 



(2) INFORMATION FOR SEQ 10 NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Komo Sapiens 
ir) TTf^r^Ur. TYPE: Testis 



wo 99/06549 



181 



PCT/IB98/0t231 



(ix) FEATURE: 

(A) NAME/KEY: OTiher 

(3) LOCATION; 103.. 248 

(C) IDENTIFICATION METHOD: blasta 

{Dl OTKER INFORMATION: identity 97 

region 12.. 157 
id W56659 
est 



(ix) FEATURE: 

(A) NAME /KEY: other 
(3) LOCATION: 255.. 385 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 99 

region 164. .294 
id W56658 
est 



(ix) FE.ATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 18.. 248 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 96 

region 1. .231 
id AA127477 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 123.. 38 5 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 98 

region 1 . .263 
id N40410 
est 



(ix) FEATURE: 

(A) NAME/KEY: Other 

(B) LOCATION: complement { 3 4 0 37 1) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: icentity 93 

region 354 . . 385 
id R93185 
est 



(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 126.. 167 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(0) OTHER INFORMATION: score 7.5 

seq VALNLILVPCCAA/WC 

(xi) SEQUENCE DESCRIPTION: SZQ ID NO: 228: 



AATTGTATGT TACGATGTTG TATTGATTTT TAAGAPAGTA ATTKRATTTG TAAAACTTCT 60 
GCTCGTTT.^C ACTGCAC.ATT GAA7ACAGGT .^VCT.=JMTGG ImI^^GGAGAGGG GAGGTCACTC 120 
TTTTG ATG GTG GCC CTG AAC CTC ATT CTG GTT CCC TGC TGC GCT GCT TGG 170 
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Met Val Aia Leu Asn Leu He Leu Val Pro Cys Cys Ala Ala Trp 

-10 -5 1 

TG7 GAC CCA CGG AGG ATC CAC TCC CAG GAT GAC GTG CTC CG? AGO TCT 218 
Cys Asp Pro Arg Arg He His Ser Gin Asp Asp Val Leu Arg 5er Ser 
5 ■ 10 15 

GC? GCT GAT ACT GGG TCT GCG ATG CAG CGG CGT GAG GCC TGG GCT GGT 266 
Ala Aia Asp Thr Giy Ser Ala Met Gin Arq Arg Giu Aia Trp Aia Giy 
20 25 30 

TGG AGA AGG TCA CAA CCC TTC TCT GTT GGT CTG CCT TCT GCT GAA AGA 314 
Trp Arg Arg Ser Gin Pro Phe Ser Val Giy Leu Pro Ser Aia Glu Arg 
35 40 45 

CTC GAG AAC CAA CCA GGG AAG CTG TCC TGG AGG TCC CTG GTC GGA GAG 362 
Leu Glu Asn Gin Pro Giy Lys Leu Ser Trp Arg Ser Leu Val Giy Glu 
50 55 60 65 

GGA CAT AGA ATC TGT GAC CTC 383 
Giy His Arg He Cys Asp Leu 
70 



(2) INFORMATION FOR SEQ ID NO: 229: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 372 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEONESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Komo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 83.. 291 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 97 

region 69. .277 
id AAi49265 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 12.. 57 

(C) IDENTIFICATICM METHOD: blastn 

(D) OTHER INFORMATION: identicy 95 

region 1 . . 46 
id .^149265 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: 321.. 351 

(C) IDENTIFICATION METHOD: blastn 



wo 99/06549 



(D) OTHER INFORMATION: 
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identity 93 
region 310. .340 
id AA149265 
est 



(ixl FEATURE: 

{A) NAME /KEY: 
(B) LOCATION: 



Other 
81. .372 



(C) IDENTIFICATION METHOD: biastn 



{D) OTHER INFORMATION: 



identity 98 
region 53. . 344 
id W39570 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 27.. 57 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 93 

region 2 . . 32 
id W39570 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 81.. 372 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 98 

region 55. .346 
id N41332 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 24.. 57 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 94 

region 1 . . 34 
id N41332 
est 



(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 10.. 168 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 7.1 

seq lAVGLGVAALAFA/GR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 



AGCCTTGCC ATG GCT GCC CGT GGT GTC ATC GCT CCA GTT GGC GAG ACT YTG 51 
Met Ala Aia Arg Gly Vai He Aia Pro Vai Giy Glu Ser Leu 
-50 -45 -40 

CGC T.AC GCT GAG TAC TTG CAG CCC TCG GCC AAA CGG CCA GAC GCC GAC 99 
Arg Tyr Aia Glu Tyr Leu Gin Pro Ser Ala Lys Arg Pro Asp Aia Asp 
-35 -30 -25 

GTC GAC CAG CAG AGA CTG GTA AGA ACT TTG ATA GCT GTA GGA CTG GGT 14 7 
Vai Asp Gin Gin Arg Leu Vai Arg Ser Leu He Ala Vai Gly Leu Gly 
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-20 



-15 



■10 



GTT CCA OCT CTT GCA TTT GCA GGT CGC TAG GCA TTT CGG ATC TGG AAA 
Val Ala Ala Leu Ala Phe Ala Gly Arg Tyr Ala Phe Arg lie Trp Lys 
-5 . 1 5 



135 



CCT CTA GAA CPA GTT ATC ACA GAA ACT GCA AAG AAG ATT TCA ACT CCT 
Pro Leu Giu Gin Val He Thr Glu Thr Ala Lys Lys He Ser Thr Pro 
10 15 20 25 



243 



AGC TTT TCA TCC TAG TAT AAA GGA GGA TTT GAA CAG AAA ATG AGT AGG 
Ser Phe Ser Ser Tyr Tyr Lys Gly Gly Phe Glu Gin Lys Met Ser Arg 
30 35 4 0 



291 



CGA GAA GCT GGT CTT ATT TTA GGT GTA AGC CCA TCT GCT GGC AAG GCT 
Arg Glu Ala Gly Leu He Leu Gly Val Ser Pro Ser Ala Gly Lys Ala 
45 50 55 



339 



AAG ATT AGA ACA GCT CAT AGG AGA GTC ATG ATT 
Lys He Arg Thr Ala His Arg Arg Val Met He 
60 65 



372 



C2) INFORMATION FOR SEQ ID NO: 230: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 254 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 



(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 3.. 249 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 1 . . 247 
id HUM225B05a 
est 



(ix) FEATURE: 

(A) NAME/KEY: Other 

(B) LOCATION: 3.. 135 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 96 

region 1 . . 133 
id HUM224A06B 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 131.. 183 

iZ) IDENTIFICATION METHOD: blastn 
;D) OTHER INFORMATION: identity 92 
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region 128 . . 130 
id H'JM224A06B 
est 

iix) FEATURE: 

(A) NAME/KEY: o-her 

(D) LOCATION: 182.. 223 

(LM IDENTIFICATION METHOD: blastn 

(D) OTHER INrORMATION: identity 95 

region 178. .219 
id HUM224A063 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: complement {2 .. 165) 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 100 

region 1 . . 164 
id R81598 
est 

(ix) FEATURE: 

(A) ^^AME/KEY: sig_^peptide 
(3) LOCATION: 126.. 170 

(C) IDENTIFICATION METHOD: Von Heijne matri.x 
(0) OTHER INFORMATION: score 7.1 

seq KLKLLSLLRPSLC/IP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 



AACACAAGCA AAACTTTTAA ATATTTGAAT TGACAGTTAC ATGTTTCATA ACTTTGTATG 60 

TCTATTGGTT GTGCAGGTGT AATTTTTTCC CTTTTTGATT AGGGTTACAA AATTTAGAGA 120 

CCAGT ATG ATT .AAG TTG AAG CTC CTT A3C CTC CTT C3A CCT AGT CTC TGC 170 
Met lie Lys Leu Lys Leu Leu Ser Leu Leu Arg Pro Ser Leu Cys 
-15 -10 -5 

.ATA CCT CAA CTT TTA CGT ACC A.AT GCT ACT CTG CTG TTC ACA ATT GCC 218 
lie Pro" Gin Leu Leu Arg Thr Asn Aia Thr Leu Leu ?he Thr lie Ala 
15 10 15 

TCA TGT AAT CTG CAG ATT CCT GCC TCC CCA CGA CGG 254 
Scr Cyo Asn L2u Gin lie Pro Aia Scr Pro Arg Arg 
20 25 



(2) INFORf-iATIOr; FOR SEQ ID NO: 231: 

(i; szQ'szr.zz characteristics: 

(A) LENGTH: :0 base pairs 

[3) TYPE: NUCL^.TC AClb 

i'Z) STRANDEDNE35 : DOUBLE 

O) TOPOLOGY: LINE.AR 

(ii) i-iOLECULE TYPE: CDNA 

fvit c=:g:::al source: 
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_ (A) ORGANISM: Homo Sapiens 
in TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 100.. x44 

IC) IDENTIFICATION METHOD: biascn 
(D) OTHER INFORMATION: identity 97 

region 95. . 139 

id T95183 

est 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 56. . 105 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 90 

region 52. . 101 
id T95X33 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: ICO.. 144 

(C} IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 97 

region 101. .145 

id R48890 

est 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 73.. 105 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORt^ATION: identity 90 

region 75 . , 107 
Id R48300 
est 

(ix) FEATURE: 

(AJ NAME/KEY: sig_peptide 
(3) LOCATION: 13.. 77 

(C) IDENTIFICATION MET.HOD: Von Keijne matrix 

(D) OTHER INFORt-lATION: score 6.5 

seq GLCVLQLTTA'/TS/AF 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 231: 



AACCTTCACA GTGTGAG ATG CCT ACT GTG AAC AGT GCT GGA TTA TGT GTC 50 
Met Pro Ser Val Asn Ser Ala Giy Leu Cys val 
-20 -IS -10 

TTG CAG 7TG ACA ACG GCA GTR ACC AGT GCC TTT TTA CTA GCA AAA GTG 9S 
Leu Gin Leu Tr.r Thr Ala Val Tnr Ser Aia Phe Leu Leu Ala Lys Val 
-5 1 5 

AAT CCT TTC GA.A RCT TTT CTC TCA AGG GGC TTT TGG CTA TGT GCT 14 3 

ksn Pro ?rie Glu Xaa Phe L-5-i :;er Ar^ 31y Phe Tr? Leu Cys Ali 
10 13 20 
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(2) IMFOE^ATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDJJESS: DOUBLE 

(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: coniplement { 118 . . 17 9) 

(C) IDENTIFICATION METHOD: blascn 

(D) OTHER INFORMATION: identity 90 

region 296. . 357 
id T92237 
est 



(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 86.Tl45 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.4 

seq ALFLLVSXYMIRS/GT 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 



ACAGAGAMTA ACGATGTTTC TTATTTGAAT CCAGTGAAAG T.^CTCATGCT TTGTGTTCTT 60 

GGGAATTACT G.nGTTCAAAT TCCTA ATG ATG CTT GGG TTA CAC TTT GCT TTG 112 

Met Met Leu Gly Leu His Phe Ala Lsu 
-20 -15 

TTT CTC CTA GTT TCT KTW TAT ATG ATC CGG AGT GGC ACT GGT AAT AAG 160 
Phe Leu Leu Vai Ser Xaa Tvr Met He Arg Ser Gly Thr Gly Asn Lys 
-10 -5 1 5 

ATT GAA GAA GGT GGG CGG 1*^8 
He Glu Glu Gly Gly Arg 
10 



(2) INFORMATION FOR SEQ ID NO: 233: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 base pairs 
(3) TYPE: NUCLEIC ACID . 
(C) STRANDEDNESS : DOUBLE 
\0) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: CONA 
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•vi;- -ORIGINAL SOURCE: 

(A) ORGANISM; Kono Sapiens 
(F) TISSUE TYPE; Testis 



fix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 32. . 178 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 97 

region 2. . 148 
id H42333 
est 



(i:<) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 35. . 178 

{CI IDENTIFICATION METHOD: biastn 
(D) OTHER INFORMATION: identity 93 

region 5 , . 148 
id R67703 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 145.. 178 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 91 

region 29. .62 
id W90193 
est 



(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 38.. 76 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.4 

seq MALLLGVLRVLLG/GF 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 



ATCGGCG'GGG CCAACCCACG GTGGGGGGAG CGCGGCC ATG GCG CTC CTG CTT TCG 55 

Met Ala Leu Leu Leu Ser 
-10 

GTG CTC CCT GTA CTG CTG GGC GGC TTC TTC GCG CTC GTG GGG TTG GCC 103 
Val Leu Arg Vai Leu Leu Gly Gly Phe Phe Ala Leu Val Gly Leu Ala 
-5 1 5 

AAG CTC TCG GAG GAG ATC TCG GCT CCA GTD TCG GAG CGG ATG AAT GCC 151 
Lys Leu Ser Giu Glu He Ser Ala Pro Val Ser Glu Arg Met Asn Ala 
10 15 20 25 

CTG TTC GTR MAG TTT GCT GAG GTG CTC GGG 181 
Leu Vai Y.aa Phe Ala Gin Val Lftii Gly 

30 35 



:2) rMrORL^L=.TIOM for seq ID MO: 234: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 base pairs 
(3) TYPE: MOCLEIC ACID 

(C) 3TRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vii ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 100.. 154 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 94 

region 111 . . 165 
id HSC2EB02I 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 100.. 154 

(C) IDENTIFICATION METHOD: blastn 
CD) OTHER INFORMATION: identity 94 

region 37 . . 91 
id T31104 
est 

(ix> FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 34.. 84 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.2 

seq LWLSLVAWHWGEA/VL 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 



ACTTTTTTCA CGCTACTCCC CCGGAGTGCT TGG ATG TTG AAG AGT CTC TGG TTG 54 

Met Leu Lys Ser Leu Trp Lea 
-15 

AGO CTT GIG GCC TGG CAC TGG GGT GAG GCT GTC CTC CTC TCC OCT CAT 102 
Ser L2U Val Ala Trp His Trp Cly Clu Ala Vai Leu Leu Ser Pro His 
-10 -5 15 

CTC CCT GCA GCG GCA GAA TGG CCC CGG GCA GCG TGT GAT TCG GGA GGT 150 
Leu Pro Ala Ala Ala Glu Trp Pro Arg Ala Ala Cys Asp Ser Gly Gly 

10 15 20 

GAA CCG 156 
Glu Pro 



[?) TMrORi-IATTON FOR SEQ ID NO: 2 35 : 

(i) SEQUENCE CHARACT£.=IISTICS: 

ik) LENGTH: 254 base pairs 



4 
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... (B) TYPE: NUCLEIC ACID 

(C) STRANDEDNES3 : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECWLE TYPE: CDNA 

tvi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
{D TISSUE TYPE: Ovary 



(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: 75. .152 

(C) IDENTIFICATION METHOD: blastn 

(0) OTHER INFORMATION: identity 96 

region 109. . 186 
id R38459 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 143-.200 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 183. .235 
id R38459 
est 



(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 183. .227 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.1 

seq IVTWLLXSFMSSA/EE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

AAT.ACTTTGG CAGCTTCTTC ACGTCGGTCC TCTCCGCGCG CGGGTAGGAA CCGTCCACGG 60 

CCTTAAAGAA GCCTCCTCAC CAGCCATACT TCCCATTGCC TCCAGCTGTT GCACGGAGGT 120 

TTCACATCAT ATTTCCACAA CGCTCCTCCA AAC.AGTCAAT ATGTGTCGCA TCCAGAGAGC 180 

TG ATG GGG ATT GTG ACT TGG CTG CTG TMA TCC TTC ATG TCA AGC GCA 227 
Met Gly He Val Thr Trp Leu Leu Xaa Ser Phe Met Ser Ser Ala 
-15 -10 -5 

GAA GAA TCT GTG TCA GCC CGC ACA CGG 254 
Giu Glu Ser Val Ser Ala Arg Thr Arg 
1 5 



(2) TNFORMATTON FOR ^r,Q TD NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STPANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 
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(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 83.. 175 

(C) IDENTIFICATION METHOD: blastn 

- (D) OTHER INFORMATION: identity 98 

region 80. .172 
id T62095 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 
CB) LOCATION: 37.. 82 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 93 

region 35.. 80 
id T62095 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 2.. 36 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 1..35 
id T62095 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 71.. 187 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 99 

region 85. .201 
id N43024 
est 



(ix) FEATURE: 

(A) NAME/KEY; other 

(B) LOCATION: 4.. 71 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 92 

region 17 . . 84 
id N43024 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 37.. 187 

(C) IDEMTIFICATIOM METHOD: blastn 

(D) OTHER INFORMATION: identity 93 

region 26. . 176 
id W42796 
est 
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(ixy- FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 86.. 187 

(C) IDENTIFICATION METHOD: blastn 

(D) OTilER INFORMATION: identity 92 

region 114 . .215 
id AA030227 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 86.. 187 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 92 

region 51 . . 152 
id AA118270 
est 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 80.. 163 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6 

seq IGLMFLMLGCALP/IY 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 



GTAGCGCGTC TTGGGTCTCC CGGCTGCCGC TGCTGCCGCC GCCGCCTCGG GTCGTGGAGC 60 

CAGGAGCGAC GTCACCGCC ATG GCA GGC ATC PJKA GCT TTG ATT AGT TTG TCC 112 

Met Ala Gly He Lvs Ala Leu He Ser Leu Ser 
-25 " -20 

TTT GGA GGA GCA ATC GGA CTG ATG TTT TTG .^^TG CTT GGA TGT GCC CTT 160 
Phe Gly Gly Aia He Gly Leu Met Phe Leu Met Leu Gly Cys Ala Leu 
-15 -10 -5 

CCA ATA TAG AAC AA.a TAC TGG CCC CCC GGG 190 
Pro lia Tyr Asn Lys Tyr Trp Pro Pro Gly 
1 5 



C2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARJ^CTERISTICS : 

(A) LENGTH: 222 base pairs 

(9) TYPE: NUCLEIC ACID 

(C) STR.ANDEDNESS : DOUBLE 

(0) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

:vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
iFl TISSUE TYPE: Ovary 

(ix) FE.ATURE: 

:A) NAME/KEY: ether 

[z) LCCATIO::: conipiemer.t (13.^. . IfiS) 
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(C; IDENTIFICATION METHOD: blas'in 
(D; OTHER INFORMATION: identity 9?. 

region 270. .299 
id W73179 
est 

{i:<) FEATURE: 

(A) MAME/KEY: other 

(3) LOCATION: ccmpienent (1 39 . . 168 ) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 93 

region 295. .324 
id R59325 
est 



(ix> FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: compiement ( 1 i 3 . . 1 4 4 ) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORf^TION: identity 93 

region 218 . .249 
Id RC6383 
est 



(i:<) FEATURE: ^i. 

(A) NAME/KEY: sig^peptide J* 

(B) LOCATION: 121. .198 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION; score 5.3 

seq VKLVTLSVPTSLA/SS 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 237: 



ATTTGTTCCA AAGGTTCCAA TTATTCAAGA CTGCCTTTGG CTTCTTTTAC A.-.CA7GGATG 60 

ATTCTATGTT ATGGGCACTG AAACTAAAAG AAACTGTGGA AGGATTCiGTA CCTTAGAGAA 120 

A7G AAA AAG CAA AAA CAT CAG AAA TTA TGG TCT ATT TCT GTA r-A3 TTA 168 
Met Lys lys Gin Lys His Gin Lys Leu Trp Cys lie Ser Val Lys Leu 
-25 * -20 -15 

GTG ACA CTCi AGT GTG CCC ACC TCT CTT GCC TCC TCT TTA ACC TCC CCT 216 
Vai Thr Leu Ser Vai Pro Thr Ser Leu Ala Ser Ser Leu Thr Ser Pro 
-10 -5 L 5 

ACA GGG 222 
Thr Giy 



{2) IliFORMATION FOR 3£Q ID NO: 233: 

(1/ SEQUENCE CHARACTERISTICS: 

(A; LEN'GTH: 4 17 base pairs 
(3) TYPE: NUCLEIC AClb 
iC) STRANDEDNES3: D0U3LE 
;0) TOPOLOGY: LINEAR 

:ii) MOLECULE TYPE: CDNA 
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(vi>- ORIGINAL SOURCE: 

(A; ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 



(ix) FSATUEttl: 

(A) NAME/KEY: other 

(3) LOCATION: complement ( 227 .. 4 14 ) 

(C) IDENTIFICATION METHOD: biascn 

(D) OTHER INFORMATION: identity 99 

region 94..28i 
id H53025 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 131.. 264 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 97 

region 190. . 323 
id H52956 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 285.. 318 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 94 

region 347. ,380 
id HS2956 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 131.. 233 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 97 

region 191 . .293 
id H53024 



i^iK) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 227.. 272 

(C) IDENTIFICATION METHOD: blastn 
CD) OTHER INFORMATION: Lcentity 93 

region 233 . .333 
ic H53024 
est 



(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: 1047.303 

;C) IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score 5.3 

seq VLFALFVAFLLRC/KL 

(:<i) SEQUENCE DESCRIPTION: SSQ ID NO: 233: 



AAGTATCTCA CC.^TTTCTTT CTCTTTCTGA .ACCACATT3G GTGCCAACAG .^^^.tTTGCTCT 50 
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CTGTTCTCTT-TCAAAATTAC CAACATGGAC CCCACCCAAT TCTTCCCT7G GPACTAAGGA 120 

ACGCCTGACT GATCATCTGA TACAGCAGTK CCTGAGCAGA ACAA---^\C;v^iC AAJLAACAGGA 180 

GAG ATG GAT GGA ATA CCC ATG TCA ATG AAG AAT GAA ATG CCC ATC TCC 223 
Met Asp Gly He Pro Met Ser Met Lys Asn Glu Met Pro He Ser 
-40 -35 -30 

CAA CIA CTG ATG ATC ATC GCC CCC TCC TTG GGA TTT GTG CTC TTC GCA 276 
Gin Leu Leu Met lie He Aid Pro Ser Leu Giy Ph.e Val Leu Fhe Ala 
-25 -20 -15 -10 

TTG TTT GTG GCG TTT CTC CTG AGA GGG AAA CTC ATG GAA ACC TAT TGT 324 
Lea Phe Vai Ala Phe Leu Leu Arg Gly Lys Leu Met Ciiu Thr Tyr Cys 
-5 ^1 5 

TCG CAG AAA CAC ACA AGG C7A GAC TAC ATT GGA GAT AGT AAA AAT GTC 372 
Ser Gin Lvs His Thr Arg Leu Asp Tyr lie Giy Asp Ser Lys Asn Vai 
io 15 20 

CTC AAT GAC GTG CAG CAT GGA AGG GAA GAC GAA GAC GGC CAT GGG 417 
Le-j Asn Asp Vai Gin His Giy Arg Giu Asp Giu As? Gly His Giy 
25 30 35 

(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDSDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORiGINAL SOURCE: 

<A! ORGANISM: Homo Sapiens 
{?) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAiME/KEY: other 

(B) LOCATION: 246.. 293 

{C) IDENTIFICATION METHOD: ciastn 
(D) OTHER INFORMATION: ider.titiy 95 

region 30 - .137 

id HJ3824 

est 

(ixj FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: 246.. 293 

(C; XDEMTIFICATION .METHCD: blastn 

(C) OTHER INFORMATION: identity 95 

region 93 . . 1 D 
i:i =73173 
est 

(ix) FEATURE: 

;A) N.AME/KEY: other 
:B. LOCATIOM: 246.. .223 



wo 99/06549 PCT/IB98/0t231 

196 

(Ci IDSNTIFICATIDN METHOD: blastn 
iO) OTHER INFORMATION: identity 93 

region 112 . .159 

id H26792 

est 

(ix) FEATURE: 

(A) MAilE/KEY: sig peptide 
(3) LOCATION: 2I.7l91 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(01 OTHER INFORMATION: score 5.7 

seq LAICSCLPGPGPA/LP 

{XI) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 



ACCTCGCTGC TCTTCATCCC ATG GGT GGA 7TT TTG CAT CTC CCT GCT CTG TCT 53 

Met Gly Gly Phe Leu His Leu Pro Ala Leu Ser 
-55 -50 



TCC TCC TGT CTT TGG ACA TTT CCA CCG ATG TGT GTT CGC ATC TTC TCC 101 

Ser Ser Cys Leu Trp Thr Phe Pro Pro Met Cys Val Arg lie Phe Ser 
-45 -40 -35 

TAT GTT CCT T?A CCT ATC CTG ACC CCC AAA ACC ATA AAT CTC ATC CCC 14 9 

Tyr Val Pro Leu Pro lie Leu Thr Pro Lys Thr lie Asn Leu lie Pro 

-30 -25 -20 -15 

GTT CTG GCC ATC TGT TCC TGT CTT CCT GGC CCC GGG CCG GCC CTT CCT 197 

Vai Leu Aia lie Cys Ser Cys Leu Pro Gly Pro Gly Pro Ala Leu Pro 
-10 . -5 1 

CTT CCT GCC TTC CCG ACC CTC CTT GTG TCT TGG TAC CAC TGC CCC CCA 24 5 

Leu Pro Ala Phe Pro Thr Leu Leu Val Ser Trp Tyr His Cya Pro Pro 
5 10 15 

CAG AA.G ?JxG ACA GGC ATG ATG GAC ACG GAT GAT TTC CGC GCC TGC CCG 2 93 

Gin Lys Lys Tnr Gly Met Met Asp Thr Asp Asp Phe Arg Ala Cys Pro 
20 ' 25 30 



i2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A; LCNGTtl: 416 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D; TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
{?] TISSUE TYPE: Testis 

(ix) FEATURE: 

.A: MAKE /KEY: Other 

LOCATION: 259.. 413 
1 IDENTIFICATION METHOD: blastn 
: O'DiER ISFOP>!ATIOM: identity 99 
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region 1 65 , .319 
id N46466 
est 



iix) FEATURE: 

(A) MAME/KEY: other 
(B? LOCATION: 55. ,147 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORM^VTION: identicy 9S 

region 52 . .144 
id N46466 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 252. .338 

(C; IDENTIFICATION METHOD: blastn 
(0) OTHER INFORMATION: identity 95 

region 38 . . 124 
id W86523. 
est 

(ix) FEATURE: 

{A; NAME/KEY: other 

(B) LOCATION: 336.. 4 13 

(C) IDENTIFICATION METHOD :. blastn 
(0) OTHER INFORMATION: identity 97 

region 123. .200 
id W86523 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: complement (322 413) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 93 

region 214 . . 305 
id W36648 
est 



(ix) FEATURE: 

(A) NAME/KEY: ether 
(3) LOCATION: complement (276 326) 
(CI IDENTIFICATION METHOD: blastn 
(D) OTHER INFORMATION: identity ?3 

cecion 302 . . 352 

id'w86o48 

est 



(iy.) FEATURE: 

(A) NPuME/KEY: sig^peptide 
i'd) LOCATION: 2 64 . .320 

:*:) IDENTIFICATION METHOD: Vor^ Keijr.e T:Jitri7. 
iiZ) OTHER IN FORMAT ICti: score 3 

ccq WGrLCvLr?.V.*H?/.«V? 

(xi) SEQUENCE DESCRIPTIOM: 3EQ ID NO: 24:.: 



GTCACACAGC TG3GATGT3G CPiGkGSTGO-Z CTTCCAGCTC CTC-VTCCCAT T3CTGG.-.C.-.:S 
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CTGCCACATC'TGGCACCCAA TTTAGGACCC CGCGGGGAGG CCCAAGCCCC GGGGGTGGCG 12C 



GGGGATCCTA GAGGAAAGTG GCAAGGCCAG GACCCTGGAG CAGAGCCAGA G7AGAAAAC7 180 

GAGGCTCTGA GAGATGAAGC TACTTGCCAA GGTCACGCAG CACAGTCACA TCCTACTGAA 240 

CATCATCCTG TTCTCTGGGT GGA ATG TCA CCA TCG CCC AGG TGG GGA TTT TTG 293 

Met 3er Pro Ser Pro Arg Trp Gly Phe Leu 
-15 -10 



TGT GTB TTG T7C ACT GCT GTA CAC CCA GCC CCC AGC ACA GCG CCT GTC 
Cys Val Lau Phe Thr Ala Val His Pro Ala Pro Ser Thr Aia Pro Vai 
-5 1 5 



341 



GAG GAC AAG TGC CCA GTA AAC ACT TGG GAA GCA ATG CAA GCG TCC TCC 
Gin Aso Lys Cvs Pro Val Asn Thr Trp Glu Aia Met Gin Ala Ser Ser 
10 " 15 20 



389 



CAG CAG CTC CTG CAA ACA GAC CCC ATG 
Gin Gin Leu Lei: Gin Thr Asp Pro Met 
25 30 



416 



(2) INFORMATION FOR SEQ ID NO: 241: 

<i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 432 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 



(11) MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Testis 



(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: 60.. 38 6 

(C; IDENTIFICATION METHOD: blast n 

(D) OTHER INFORMATION: identity 93 

regica 7 . .333 
id AA035203 
est 



(ix) FEATURE: 

(A} NAME/KEY: Other 
(3) LOCATION: 400.. 429 

(C) IDENTIFICATION METHOD: blast.-^ 

(D) OTHER INFORI^ATION: identity ICO 

region 349.. 373 
id AA035208 



(ix) FE-BlTURE: 

(A) NAME/KEY; ether 
:3) LOCATION: 77.. 429 

(C: IDENTiriCATION METHOD: hl=i3z:. 
{C; OTHE-^ Isror^aMATlOM: idertir./ -K* 
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region 1 . . 353 
ici'H64963 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 50.. 328 

(C) . IDENTIFICATION METHOD: blascn 

(D) OTHER INFORMATION: identicy 98 

region 10. .288 
id R97144 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 56.. 393 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 97 

region 3 . .340 
id N73170 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 14.. 300 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identi-y 98 

region 2 . . 288 
id H13072 
est 



(ix) FEATURE: 

(A) NAME/KEY: sig^peptlde 
(3) LOCATION: 154.. 381 

(C) IDENTIFICATION MITHOD: Von Hei]ne matrix 

(D) OTHER INFORMATION: score 5.3 

sec lILASASrSPNTT/QV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 i: 



AGTAAAAAAA CACTGGAATA AGGAAGGGCT GATGA^TTTC .^GA.nGATGAA GGTP-^3TAGA 60 

AACCGTTGAT GGGACTGAGA AACCAGAGTK A=u5Ji.CCTCTT TGGAGCTTCT GAGGACTCAG 120 

CTCG-^iACCA.-. CGGGCACAGT TGGCAACACC ATC ATG AC A TCA CAA CCT GTT CCC 174 

Met The Ser Gin Pro Val Pro 

-73 -70 

AAT GAG ACC ATC ATA GTG CTC CCA TCA PAT GTC ATC AP.C TTC TCC C.-A 222 
Asn Glu Thr lie lie Val Leu Pro Ssr Asn Vai lie Asn ?ae Ser Gl.- 
-65 -60 -55 

GCA GAG AAa CCC G^^A CCC ACC AAC CAG GGG CAG GAT ACC CTG AAG .^.-A 270 
Ala Glu L-/3 Vzo Glu Cro Thr .^sn Gin Gly C:.'*. Asp Ser Leu Lys Lys 
-50 -45 * -40 

CAT CTA C;.C GCA GAR RTC AAA GTT ATT GGG ACT ATC CAG ATC TTG TGT 313 
Ki-: Leu His Ala Glu Xaa Lys Vai lie Giv T^r lie Gin lie Leu Cys 
-35 -33 -25 
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GGC ATG ATG GTA TTG AGC TTG GGG ATC 
Giy Met: Met Vai Leu Ser Leu 31/ lie 
-20 -15 

TCT CCA AAT TTT ACC CAA G7G ACT TCT 
Ser Pro Asn Phe Thr Gin Val Thr Ser 
-5 1 

CCA TTC ATA GGA CCC GGG 
Pro Phe lie Giy Pro Gly 
15 



ATT TTG GCA TCT GCT TCC TTC 366 
lie Leu Ala Ser Aia Ser ?ne 

-10 

ACA CTG TTG AAC TCT GCT TAC 414 
Thr Leu Leu Asn Ser Ala Tyr 
5 10 

432 



(2) INFORMATION FOR SSQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 37 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 



(ix) FEATURE: 

CA) NAME/KEY: other 

(B) LOCATION: 47.. 230 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity IOC 

recicn 158 . . 341 
id A-AC40ai3 
est 



(i:<) FEATURE: 

:A) NAME/KEY: other 
iEi) LOCATION: 229.. 395 

(C) IDENTIFICATION ^]£THOD: clascn 

(D) OTHER INFORMATION: icentitv 99 

region 3^1.. 507 
ic i.ij4 0813 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 205. .429 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 93 




est 



(IV) FEATURE: 

(A) NAt-lE/KEY: other 
(3; LOCATION: complement ( 32 5 4 2 2 ; 
;':') IDENTIFICATION METHOD: claJtn 
iO) OTHER INFO?J<ATION: ider.titv 56 
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id 
es" 



(ix) FEATURE: 

:A) NAME/KEY: other 
:B) LOCATIOM: compiemenr (215. .269) 
;C) IDENTIFICATION METHOD: olastn 
:D) other INFORMATION: identity 98 

region 381 435 

id'AAC40I49 

est 



:ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: complement (279 .. 327 ) 
;C) IDENTIFICATION METHOD: blastn 
(D) OTHER INFORMATION: identity 100 

region 321. . 369 
id .=AC40149 
est 



:iv) FEATURE: 

(A) NAME/KEY: sig_?eptide 

(B) LOCATION: 57.. 329 

iC) IDENTIFICATION METHOD: Von Heijr.e matrix 
(D) OTHER INFORMATION: score 4.8 

sec IILRLPWLNRSQT/VV 

[Ki] SEQUENCE DESCRIPTION: SEQ ID NO: 242: 



ACGCTTCGTC CTCTGCAGTC AAGACGCTGG GCGCGTCGAG GACTGGGATT TCAA-AT ATG 59 

Met 



CGT GCA TTA GAG AAT GAT TTT TTC AAT TCT CCC CCA AGA Pi?JK ACT GTT 
Arg Ala Lea Glu Asn Asp Phe Phe Asn Ser Pro Pro Arg Lys Thr Val 
-90 -85 -30 -75 



-5 



GCr "7 CT7 -^GT AAT CTT GTA TCA GCA GAG -.CT GTT TTC CTC CCG 
Ala ?r.^ -^.j Glv Asn Leu Vdl Ger Ala Glu Tr.r Vai ?r.2 Leu F;:z Pro 

15 20 



L07 



CGG TTT GGT GGA ACT GTG ACA G.AA GTC TTG CTG A^G TAC AAA .A3.G GGT 155 
Arg Phe Gly Glv Thr Val Thr Glu Val Leu Leu Lys Tyr Lys Lys Gly 
-70 -65 -60 

GAA ACA AAT GAC TTT GAG TTG TTG AAG AAC GAG CTG TTA GAT CCA GAC 203 
Giu Tr.r Asn Aso Phe Glu Leu Leu Lys Asn Gin Leu Leu Asp Pro Asp 
-35 -50 -45 

ATA AJ^G GAT GAC CAG ATC ATC A-AC TGG CTG CTA O^J'. TTC CGT TZT TCT 251 
lie Lys Aso Asd Gin lie lie Asn Trp Leu Leu Glu Phe Arg Ser Ser 
-40 -35 -30 

GTC ATG TAC TTG ACA AAA GAC TTT GAG CA^v CTT ATC AGT ATT ATA TTG 299 
val M=- Tyr Leu Thr Lys Asp Phe Glu Gin L^u He Ser lie 11^ L*m 
-25 -20 -15 

AGA TT3 CCT TGG TTG AAT AGA AGT CAA ACA GTA GTG C^^-A GAG TAT TTG 347 
Arg Leu Pr:; Tr? Leu Asn Arg Ser Gin Thr Val Val Glu Glu .T /r Leu 



305 
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TGT CTC AGC ATG ATT GCT TCC CAT TTT GWG CCT CCC GAG CTG 4 3"? 

Cvs Leu Ser Met lie Ala Ser His Phe Xaa Pro Pro Glu Leu 
25 30 35 



(2) INFORMATION FOR SEQ ID MO: 243: 



(i: SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 4 base pairs 
(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 



(ijL) MOLECJLE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: Other 

(B) LOCATION: 54.. 242 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 97 i' 

region 12. .200 
id R19497 
est 



(ix) FE.ATORE: 

(A) NAME/KEY: other 

(3) LOCATION: 78. .242 

(C) IDENTIFICATION METHOD: blastn 

(0) OTHER INFORMATION: identity 98 

region 1 . .165 
id H75597 
est 



(ix) FE.ATURE: 

(A) NAME/KEY: Other 
(3) LOCATION: 84.. 242 

(C) IDENTIFICATION METHOD: blasLri 

(D) OTHER INFORM.ATION: identity 93 

region 1 , . 159 
id H93393 
esz 



(ix) FE.ATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 122.. 243 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 1 . . 122 
id HUM030E11B 
est 

cZATURE: 
(A; name /KEY: sig_pepcide 
(=: LOCATION: 74 !Tl66 

[Z: IDEMTIFItATION METKO!;: Von Heijn- r.^irly. 



wo 99/06549 PCT/IB98AI123] 

203 

(D) OTHER INFO!iMA?ION: score 4.8 

seq WAFSCGTWL?SE<A/EW 



(xi) SEQUEMCi DESCRIPTION: SEQ ID NO: 24 3: 



ATAGAAGGGG GTGGGGCCAC GTTTGCGTCC GCGCCATCAG GCCCGAGATA GCGGCGAGGT 60 

CCGCTTTCAG TGT ATG GTT TTC CCT GCC AAA CGG TTC TGC TTG G7G CCA 109 
Met Val Phe Pro Ala Lys Arg Phe Cys leu Val Pro 
-30 -25 -20 

TCC ATG GAG GGC GTG CGC TGG GCC TTT TCC TGC GGC ACT TGG CTG CCG 15"? 
Ser Met Glu Giy Val Arg Trp Ala Phe Ser Cvs Gly Thr Trp Leu Pro 
-15 -10 -5 

AGC CGA GCC GAA TGG CTG CTG OCA GTG CGA TCG ATT CAG CCC GAG GAG 205 
Ser Arg Ala Glu Tro Leu Leu Ala Val Arg Ser lie Gin Pro Glu Glu 
1 ' S 10 



AAG GAG CGC ATT GGC CAG TTC GTC TTT GCC CGG GAC GGG 24 4 

Lys Glu Arg lie Gly Gin Phe Val Phe Ala Arg Asp Gly 
15 20 25 



(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 3 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEONESS: DOUBLE 
CD) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 



J[ix) FEATURE: 

(A) NAME/KEY: Other 

(B) LOCATION: 101. .273 

to IDENTIFICATION METHOD: blastn 
(D) OTHER INFORMATION: identity 92 

region 159. .331 

id W57194 

est 



(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(9) LOCATION: 95.. 340 

(C) IDENTIFICATION METHOD: Von H.eijne rriatrix 
(0) OTHER INFORMATION: score 4.7 

sec LTCLADLFHSIAT/XK 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 



PuAGAGGAGAA GGCTGGGTAT G.^GGAAACTG CTGTCTTTTC ^.CTZZ.-.ZC^^G ACGT.^AATAT 
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TAACTGGACC - TCTCTTTAGA TTCTTTGCTC AATA A7G AAT TGT TT7 CAG GGC ACC iL5 

Met Asa Cys ?he Gin Gly Thr 
-30 

AAT GCC TCT GCT ITTG GAA AAA GAC ATT GGT CCA GAG CAG TTT CCA ATC 163 
Asn Ala Ser Ala Leu Giu Lys Asp lie Gly Pro Giu Gin Phe Fro He 
-75 -70 -65 -60 

AAT GAA CAC TAT TTC GGA TTG GTC AAT TTT GGA AAC ACA TGC TAC TGT 211 
Asn Giu His Tyr Ptie Giy Leu Val Asn Phe Gly Asn Thr Cys Tyr Cys 
-55 -50 -45 

AAC TCC GTG CTT CAG GCA TTG TAC TCC TGC CGT CCA TTC CGG GAG AAT 259 
Asn Ser Vai Leu Gin Ala Leu Tyr Ser Cys Arg Pro Phe Arg Giu Asn 
-40 -35 -30 

GTG TTG GCA TAC AAG GCC CAG CAA AAG AAG AAG GAA A.AC TTG CTG ACG 307 
Val Leu Ala Tyr Lys Ala Gin Gin Lys Lys Lys Giu Asn Leu Leu Thr 
-25 ' -20 -15 

TGC CTG GCG GAC CTT TTC CAC AGC ATT GCC ACA SAG P.AG AAG AAG GTT 555 
Cys Leu Ala Asp Leu Phe His Ser lie Ala Thr Xaa Lys Lys Lys Val 
-10 -5 .1 5 

GBR TCA TCC CAC CTG GGG 373 
Xaa Ser Ser His Leu Gly 
10 



(2) INFORMATION FOR S£Q ID NO: 24 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 
(U) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 73.. 182 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: iaentity 93 

regica 68. .177 
W63868 

eat 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 62. . 182 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORt-LCwTIOM: identity 96 

region 1 . .121 
id C17761 



est 
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(ix) FEATURE: 

(A) KAME/KEY: other 
•:B) LOCATION: complemen" ( 1 50 . . 182 ) 
•;C) IDENTIFICATION METHOD: blastn 
{D) OTHER INFORMATION: identity 96 

region 185. .217 
id M60944 
esz 



(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 20.-67 

(C) IDENTIFICATION METHOD: Vcn Heijna matrix 

(D) OTHER INFORMATION: score 4.6 

seq ALRVRXXXFGTRA/CR 

(xij SEOCJENCE DESCRIPTION: SEQ ID NO: 245: 



iJVTTTCCGAS CCGGGCAAG ATG GCA GCG GCG CTG CGC GTG CGT KGT TSA STG 52 

Met Ala Ala Ala Leu Arg Val Arg Xaa Xaa Xaa 
-15 -10 

TTC GGG ACG CGG GCC TGC AGG CGC CAT GGT CTT CCT CAC CGC GCA STC 100 
?he Giy Thr Arg Ala Cys Arg Arg His Gly Leu Pro His Arg Ala Xaa 
-5 1 5 10 

TGG CTG CGG IKXT CGC GTC ASC GAC CGC TAG TTT CGG ATC CAG GAG GTG 14 8 
Trp Leu Arg Asn Arg Val Xaa Asp Arg Tyr ?he Arg lie Gin Glu Val 
15 20 25 

CTG AAG CAS GCC AGG CAC TTC CGG GGA AGG AAA AGG 134 
Leu Lys Xaa Ala Arg His Phe Arg Uly Arg Lys Arc 
30 35 ' 



;2J INFORMATION FOR SEQ ID NO: 246: 



a) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 base pairs 

(B) TYPE: NUCLEIC AClb 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 



(ix) FEATURE: 

(A) NAME/KEY: Other 
(3) LOCATION: 35.. 186 

(C) IDENTIFICATION METHOC : blastn 

(D) OTHER INFORMATION: identity 

legion 1 . . 1 S2 
Id .i^A0533:a 
est 
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{ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 32. .135 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 100 

region 1 , . 104 
id T50012 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 134.. 186 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 98 

region 102 . . 154 
id T50012 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 56. . 186 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 95 

region 33 . .165 
id H79942 
est 

(ix) FEATURE: 

(A) NAME/KEY: Other 

(B) LOCATION: 21. .135 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 93 

region 3 . . 117 
id AA0586O5 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: 134 186 

(C) ■ IDENTIFICATION METHOD: blast- 

(0) OTHER INFORMCiTION: identity 93 

region 1 IS . .167 
id AA05Q605 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 43. .186 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 99 

region 1 . . 139 
id R37526 
est 



(i.x) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: So.TlOO 

(C) IDENTIFICATION METHOD: Von He'^:'^ -^rTrix 
iO) OTHER INFORriSiTIOM: score 4.6 

seq LLTH^;^:-S5HV?C/VG 
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(xir SEQUENCE DESCRIPTION: SEQ ID NO: 246: 



CTAATCGAAA AGGGGGATTT TCCGGTTCCG GCCTGGCGAG AGTTTGTGCG GCGAC ATG 56 

Met 
- * 5 

AAA CTG CTT ACC CAC AAT CTG CTG AGC TCG CAT GTG CGG GGG GTG GGG 106 
Lys Leu Leu Thr His Asn Leu Leu Ser Ser His Val Arq Giy Val Gly 
-10 -5 1 

TCC CGT GGC TTC CCC CTG CGC CTC CAG GCC ACC GAG GTC CGI ATC TGC 154 
Ser Arg Gly Phe Pro Leu Arg Leu Gin Ala Thr Glu Val Arg lie Cys 
5 10 15 

OCT GTG GAA TTC AAC CCC AAC TTC GTG GCG CGA CGG 190 
Pro Val Glu Phe Asn Pro Asn Phe Val Ala Arg Arg 
20 25 30 



(2) INrORMATION FOR SEQ ID MO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 189 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STKAMDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
{?) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: Other 
(S) LOCATION: 54.. 186 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 94 

region 45 . . L77 
id HSC2KH09i 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 9.. 52 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFOW-IATION; identity 9H 

region 1 . . 44 
id H3C2KH091 
est 

(ix) FEATURE:' 

{A: NAME/KEY: other 
(3) LOCATION: 82.. 117 
iZ) IDENTIFICATION METHOD: blastn 
(D) OTHER INFOPJ^ATION: identity 97 

region 1 . . 35 
id A.A09C7C4 
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iix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 129.. 166 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 93 

region 36. . 93 
id AA126596 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 93.. 131 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 94 

region 1. . 39 
id AA126596 
est 



(ix) FEATURE: 

(A) NAME/KEY: Other 

(B) LOCATION: 122.. 181 

iC) IDENTIFICATION METHOD: blastn 
fD) OTHER INFORMATION: identity 91 

region 4 0. . 99 ^i. 

id AA090640 ** 

est 



(ixl FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 88.. 117 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 96 

region 8 . . 37 
id AA090640 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 99.. 18 6 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 95 

region 1. .88 
id T36119 
est 



(ix) FEATURE: 

(A) NAME /KEY: sig_,peptide 
(3) LOCATION: 7 . . 129 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.6 

seq VSAGSLLLrAPOA/EX 

(xii SEQUENCE DESCRIPTION: SEQ ID NO: 247: 



P^ACGGC ATG GGA TWC TTC TCA CGG CGC ACG TTC TGT GGG CGG AGT G'Zr, 43 
Mot Cly Xaa Phe Scr Arg Arg Thr Phe Cys Gly Arg Ser Gl> 
-40 -35 -30 

CGG AGC TGC CG3 GGT CAG TTG GTC CAA GTG TCC CGG CCT GAG GTG T?.:: 96 
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Arc Ser Cy^s Arg Gly Gin Leu Val Gin Val Ser Arg Pro GIu Val Sar 

-25 -20 -15 

GCC GGA TCC CTC CTT CTC CCG GCG CCT CAA GCG GPJ^ GAS CAT ?CC TCA 14 4 

Aia Giy Ser Leu Leu Leu Pro Ala Pro Gin Ala Glu Xaa His Ser Ser 

-10 -5 15 

WGR RTT TTG TAT CCA AGG CCC AAA AGT TTG TTA CCC A.^G ATG GGG 139 

Xaa Xaa Leu Tvr Pro Arg Pro Lys Ser Leu Leu Pro Lys Met Gly 

10 15 20 



(2) INFORMATION FOR SSQ ID NO: 24 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MCLECrJLE TYPE: CDMA 

(vi) ORIGINAL SOURCE: 

•:A) ORGANISM: Homo Sapiens 
[?) TISSUE TYPE: Ovary 

(ix) FEATURE: 

fA) NAME/KEY: other 
(3) LOCATION: 132.. 235 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 99 

region 94. .197 
id R36207 
est 

(i:<; FEATURE: 

(A) NAME /KEY: otner 
(5) LOCATION: 37.. 110 
iC) IDENTIFICATION METHOD: biastn 
•;D) OT.HER INFOR-IATICN: identify 100 

region I . . 74 
id R36207 
est 

(ixi FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 41.. 194 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORilATION: . idencicy 94 

regior. 1 . .154 
id .AAC507 56 
esz 

(i;:; rZMURE: 

;A) NAME/KEY: other 

:3) LOCATION: 107.. 235 

:C) IDENTIcICATIOJI METHOD: biastr^ 

:D) other IMFO?:'L^-TICN: iaentiiy ?6 

regior IC. .143 
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(ix) FEATUEIE: 

•:A) NAME/KEY; other 

(B) LOCATION: 141. .193 

;C) IDEMTIFICATIOM METHOD: biastn 
:D) other INFORMATION: identity 96 

region 33 . . 85 
id AAC91520 
est 

(ix) FEATURE: 

{A) NAME/KEY: other 
(3) LOCATION: 190.. 237 

(C) IDENTIFICATION METHOD: biastn 

(D) GTHEH INFORM/i.TICN: identity 9 7 

region 81.. 123 
id AA091520 
est 



(ix) FEATURE: 

(A) NAME/KEY: Other 

(3) LOCATION: 109.. 142 

(C) IDENTIFICATION METHOD: biastn 

(D> OTHER INFORMATION: identity 100 

region 2 . . 35 
id AA091520 
est 



(XX) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 1..165 

(C) IDEtJTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.6 

seq CALSLFD.=i?GASG/GR 

(xi) SEQUENCE DESCRIPTION: SEQ ZD HO: 248: 



ATG GAA GGA GGC GTT CGT CTA G.-.7 TIG 

Met Glu Gly Gly Val Arg Leu Asp Leu 
-55 " -50 

GGA GTC GCT GTC TCT GAA CTT CCA GCC 

Gly Val Ala Val Ser Glu Leu Pro Ala 
-35 

CCC GAG GGC CAT GGG CCG GGT CTC AGG 

Pro Glu Gly His Gly Pro Gly T.ftu Arg 

-20 -15 

GAC GCT CCT GGC GCA TCT GGT GGT CGT 

Asp Ala Pro Gly Ala Ser Gly Giy Arg 
-5 ' 1 

GGA CAG CAA CAT ACA GGC CTG CC7 GCC 

Gly Gin Gin His Thr Gly Leu Pro Aia 
10 15 



TC3 CGT TGC GGG GAG ACT TCA 48 
Ser Ala Cys Gly Glu Thr Ser 
-15 -40 

TC". G.-.G ACC GCC GCC CTT GTC 96 
Ser Jlu T.Tr Ala Ala Leu Val 
-30 -25 

GCT TGI GCC CTC TCG CTT CCT 144 
Al?. Cv? AU Leii Sftr Pro 
-10 

CAT CAC CTT ATT CTG GTC CCG 192 
His .-:is Leu He Leu Val Pro 



TCT Zr::: C:T ZJKC CCC CAG 237 
Ser V?..l His Pro Gin 
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{2) INFORMATION FOR SEQ ID NO: 249: 

(i) SF.QUENCE CHARACTERISTICS: 

(A) LENGTH: 216 base pairs 

(3) T*^?E: NUCLEIC ACID 

(C) STRANDEDNISS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 14 4.. 213 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 97 

region 1 . . 70 
id N538I6 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 21.. 63 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 1. .43 
id T34269 
est 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: 1637.204 

(C) IDENTIFICATION METHOD: Von Heij.-.e matrix 

(D) OTHER INFORMATION: score 4.5 

seq TLL5rAAlT.=uAFS/VL 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 



.^LAGCGCCGGA HGCGGTGAGG CAC.AGATGAG TAACG7G.=iAT TTG7CCGTCT CCGACTTCTG^ 60 

GAGGTAAGGC GGTCGTCAGC CT.ATCTCTTC TGCTGGCTGG GCTC.=iATGCC GCGGGTGAGC 120 

GTTCGGCCGA GGCTGCTCCT ACCCTTGAGT GATGTGCC7T GA ATG ACG CTG CTT 174 

Met Thr Leu Leu 

TCA TTC GCT GCT CTC ACG GCT GCT TTC ICC G7C CTC CZC AAG 216 
Ser Phe Ala Ala Leu Thr .-.la Ala Phe Ser Val Leu Pro Lys 
-10 -5 1 



(2) IMFORtlATION FOR SEQ ID ?I0: 250: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 base pairs 
(3) TYPE: NUCLEIC ACID 
(C; 3TPAMD£D>:E33: DOUBLE 
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"(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: CDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 



(ix) FEATURE: 

(A) NAME/KEY: other 
(B} LOCATION: 46. .271 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 98 

region 36 . .261 
id H3C3IF011 
est 



(ix) FEATURE: 

(A) NAMS/KEY: other 
(8) LOCATION: 11.. 44 

(C) IDENTIFIC.^VTION METHOD: blastn 

(D) OTHER INFORMATION: identity iCO 

region 2 . . 35 
id HSC3IF011 

est i"-- 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 50.. 271 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 98 

region 39. .260 
id N28442 
est 



(i:0 FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: 18.. 234 

(C) IDENTIFICATION METHOD: blastn 

(U) OTHER INFORMATION: identity 93 

r-gicn 1. .217 
id H'JM517CC1B 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 125.. 215 

(C) IDENTIFICATION METHOD: biascr: 

(D) OTHER INFORMATION: iJ-iiiL-ty 98 

region 1 . . 91 
id 17^607 



(ix) FEATURE: 

(A) NAME/KEY: other 
(S) LOCATION: 217.. 271 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 9^ 

region 92 . .146 

id'T77o07 

est 
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(ix)- -FEATURE: 

(A) NAME/KEY: sig_?eptide 
(3) LOCATION: 3 6.. 98 

(C) IDENTIFICATION* METHOD: Von Hei3ne natri>: 

(D) OTHER INFORt'tATION: score 4,4 

seq GLSKLQFA?rSSA/LD 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

AGTGGT7GCN GGAAGTTGAG CGGCGGCAAG AAATA ATG GC3 CCA GCT ACG GGG 53 

Met Ala Ala Ala ?hr Gly 
-20 

GAT C^T GGA CTC TCT AAA CTG CAG TTT GCC CCT TTT AGT AGT GCC TTG 101 
Aso Pro Gly Leu Ser Lys Leu Gin Phe Ala Pro Phe Ser Ser Ala Leu 
.15 -10 -5 ' 

GAT GTT GGG TTT TGG CAT GAG TTG ACC CAG AAG AAG CTG AAC GAG TAT 14 9 
ASP Val Gly Phe Trp His Glu Leu Thr Gin Lys Lys Leu Asn GIu Tyr 
5 10 15 

-GG CTG GAT GAA GCT CCC AAG GAC ATT Ai^.G GGT TAT TAC TAC AAT GGT 197 
Arg Leu Asp Giu Ala Pro Lys Asp lie Lys Gly Tyr Tyr Tyr Asn Gly 
20 25 30 

GAC TCT GCT GGG MTG CCA GCT CGC TTA ACA TTG GAG TTC AGT GCT TTT 24 5 
Asp Ser Ala Gly Xaa Pro Aia Arg Leu Thr Leu Giu Phe Ser Ala Phe 
35 40 45 



GAC ATG AGT GCT CCC ACC CCA AGC 
AsD Met Ser Aia Pro Thr Pro Ser 
50 55 



269 



(2) IMFORM^.TION FOR SEQ ID KO: 25i: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 5 base pairs 

(31 TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CONA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

{i:0 FEATURE: 

(AJ NAME/KEY: ether 

(3} LOCATION: 39.. 14 3 

iZ) IDENTIFICATION METHOD: biascn 

iO) OTHER INFORiMATION: icii-.-i:y 55 

re'jion 50 . . 154 
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(A) NAME/KEY: other 

(B) LOCATION: 3. .45 

(C) IDENTIFICATION METHOD: biastn 
(D; other INFORr<ATION: idenziity 9C 

region 15. . 57 
id R50695 
est 



(i:<) FEATURE: 

(A} NAME/KEY: other 

(B) LOCATION: 81.. 143 

(C) IDENTIFICATION METHOD: biastn 
(0) OTHER INFORMATION: identity 96 

region 104 . . 166 
id R94786 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 81.. 143 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 96 

region 105. . 167 
id T98442 
est 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 50.. 130 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.4 

seq LSKSLLLVPSXLS/LL 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 



AAGCTTCCCC TCCCCCGGCG CCCTCTGGGG CTCCGAGCCC GGCGGGkCC ATG TTC ACC 58 

Met ?he Thr 
-25 

AGC ACC GGC TCC AGT GGG CTC TAC AAG GCG CCT C7G TCG ?AG .^GC CTT 106 
Ser Thr Gly Ser Ser Gly Leu Tyr Lys A!?. Pro Leu Ser Lys Ser Leu 
-20 ~15 -10 

CTG CTG GTC CCC AGT RCC CTC TCC CTC CT3 CSC GCC CAG 145 
Leu Leu Val Pro Ser Xaa Leu Ser Leu Leu Xaa Aia Gin 
-5 1 5 



(2) i::cOR!-lATION FOR 3ZQ ID KG: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRAMDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

MOLECULE TYPE: CDHA 



;vi) c?.:ginal source: 
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- (A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 



(ixl FEATURE: 

(A) NAME/KEY: other 

(B> LOCATION: 137., 291 

(C) IDENTIFICATION METHOD: blastn 

{D) OTHER INFORMATION: identity 99 

region 13S. .292 
id AA121372 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 5.. 91 

(C) IDENTIFICATION METHOD: oiastn 

(D) OTHER INFORMATION: identity 97 

region 1 . , 86 

id'AA121372 

est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 318.. 397 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 98 

region 322. .401 
id AA121372 
est 



(ix> FEATURE: 

(A) NAME/ KEY: other 

(B) LOCATION: 95.. 132 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 94 . . 131 
id AA121372 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 284. .313 

(C) IDENTIFICATION METHOO: blditn 

(D) OTHER INFORMATION: identity 90 

region 235. . 315 
Id PAi2L372 



(ix) FEATURE: 

(A) NAME/KEY: other 
(3) L0C.2,TI0N: 2.. 102 

(C) IDENTIFICATION METHOD: oi^stn 

(D) OTHER INFORMATION: identity 98 

reel on 15. . 115 
id'T53974 



:ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCA.TICrf: 150.. 2 53 

(Ci IDEMTIFICATION MlT.-iOi' . i-lasin 
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O) 0?HiR INFORMATION: identity 92 

region 167 . . 275 
id T53974 
est 



(ix) FEATURE! 

(A) NAME/KEY: other 

(B) LOCATION: 95.. 171 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 97 

region 111. .187 
id T53974 
est 



(ix) FEATURE: 

(A) NAME/KEY: Other 

(B) LOCATION: 2. .102 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 98 

region 15 . . 115 
id R09314 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 95.. 171 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 97 

region 111. .187 
id R09314 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: 150.. 222 

{C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 90 

region 167. .239 
id'R05314 



(Ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 179. .298 

(C) IDENTIFICATION METHOD: Vop. Heijne matrix 

(D) OTHER INFORMATION: score 4.3 

seq ITLVSAAPGKVIC/EM 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 



AAAATCGCGG ACCACCGGGG CTGCCAKCTC GCCTCACTCC CGGCCTCTTG CGCTCCTAGG fiO 

GGCGGAGAAG GGTGCGGGCT CTTCGCCCTT TGTGTCCTTC TTTCACT.2J5.C TTCTGGACTT 

TCCAGCTCTT CCGAAGTTCG TTCTTGCGC.=i .^AGCCCAAAG GCTGGAAA.JVC CGTCCACG 

ATG A.CC AGC ATG ACT CAG TCT CTC, CGC. r,AG Z7G ATA A-C.G GCC ATG ACC 
<^»r "^hr Ser Met Thr Gin Ser Leu Arg GIu Val lie Lys Ala Met Thr 
-4C -35 -30 -25 



L20 
178 
226 
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AAG GC7 CQQ AAT T7T GAG AGA GTT TTG GGA AAG ATT ACT CTT GTC TCT 27 4 
Lys Ala Arg Asn ?he Giu Arg Val Leu Gly Lys He Thr Leu Val Ser 
-20 -15 -10 

GCT GCT CCT GGG AAA GTG ATT TGT GAA. ATG AAA GTA GAA GPJk GAG CAT 322 
Ala Ala Pro Gly Lys Val lie Cys Glu Me; Lys Val Giu Glu Glu His 
-5 1 5 

ACC A.=iT GCA ATA GGC ACT CTC CAC GGC GGT TTG ACA GCC ACG TTA GTA 370 
Thr Asn Ala He Gly Thr Leu His Gly Gly Leu Thr Ala Thr Leu Val 
10 15 20 

GAT AAC ATA TCA ACA ATG GCT CTG CTA TGC ACG GAA AGG GGA GCA CCC 4 18 
Asp Asn He Ser Thr Met Ala Leu Leu Cys Thr Giu Arg Gly Ala Pro 
25 30 35 40 

GGA GTC ACT 4 27 

Giy Val Ser 



(2) INTORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 2.. 28 5 

(C) IDENTIFICATION METHOD: Dlastn 

(D) OTHER INFORMATION: identity 98 

recion 3 . . 291 
id'T31110 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 278.. 331 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 96 

region 285. . 333 
id 731110 



(ix) FEATURE: 

ik) NAME/KEY: Other 

(5) LOCATION: 2,. 329 

(C) IDENTIFICATION i-iETHOO: oiastn 

[0) OTHER I^^FOR^LATI0^:: identity 98 

region 6 . . 333 
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(ix) t£ATJRE: 

(AJ NAME/KEY: other 
(3) LOCATION: 2.. 329 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identify 98 

region 7 . . 334 
id T35807 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 9.. 331 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 93 

region 1 . . 323 
id T33763 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 15.. 331 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 99 

region 1 . . 317 
id AA132848 
est 



(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) L<X:ATI0N: 75. -293 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.3 

seq DIILSGLVPGSTT/LH 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 



AAGCG.^GCCC AGGCGGCAGT CTTGATTCCC TTTTGGCCAG CAGTTTTT.AG GTCTGTCAGT 60 ' 



ACTGCACTGC AAGA ATG GCA GAT TIT GGG ATC TCT GCT GGC CAG TTT GTG 110 
Met Ala Asp Phe Gly lie Ser Ala Gly Gin Phe Vai 
-70 -65 

GCA G7S GTC TGG GAT AAG TCA TCC CCA GTG GAG GCT CTG PJ^SK GGT CTG 158 
Aid Vai Vdl Trp Asp Lys Ser Ser Pio Vdl Glu Ala Leu Lys Gly Leu 
-60 -55 -50 

GTG GAT AAG CTT CAA GCG TTA ACC GGC ?JKT GAG GGC CGC GTG TCT GTG 206 
Val Asp Lys Leu Gin Ala Leu Thr Gly Asn Glu Gly Arg Vai Ser Vai 
-45 -40 -35 -30 

GAA. AAC ATC AAG CAG CTG TTG CA.a TCT GCC CAC AAA G.^A TCC AGC BTT 254 
Glu Asn He Lys Gin Leu Leu Gin Ser Ala His Lys Glu Ser Ser Xaa 
-25 -20 -15 

GAC ATT ATT TTG TCA GGT TTA GTC CCA GGA ?iGC ACC ACT CTG CAC AGT 302 
Asc :i9 He Lau Ser Gly Leu Val :;ro Gly Ser The Thr Leu His Ser 
-10 -5 1 



3CT GAG ATT TTG GCT G.AA ATC QZZ 'GG G'Z 
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Ala Glu lie Leu Aid Glu He Ala Arq Vai 
5 10 



{?.) rNFORMATTON FOR SEQ tD NO: 254: 

ii) SZQUZtKZ CHARACTERISTICS: 

(A) LEJJGTH: 131 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

|vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 36. . 123 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 13 . . i05 
id AA115592 
est 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION; 84.. 125 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.2 

seq GILLGLLLLGHLT/VR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 



Aj\CAGACGCT GGCGGCCACC AGAAGTTTGA GCCTCTT7GG TAGCAGGAGG CTGGAAGAAA 60 

GGACAGAAGT AGCTCTGGCT GTG ,ATG GGG ATC TTA CTG GGC CTG CTA CTC CTG 113 

Met Giy lie Leu Leu Giy Leu Leu Leu Leu 
-10 -5 



;G C.^C CTA AC A GTG AGA 131 
Giy His Leu Thr Vai Arg 
1 



[2) rMFOR>LATI0:; FOR SEQ ID NO: 235: 

(!) 5SQU£>:CE CKAR.^\CTERISTIC3: 

(A) LENGTH: 486 base pairs 
(3: TYDE: NUCLEIC ACID 
{Ci STRANDEDNESS: D0U5LE 
(D) TOPOLOGY: LINEAR 

.li) MOLECULE TYPE: COM/-. 



(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

(ix) FEATURE: 

(A} NAME/KEY: other 

(3: LOCATION: 13. .53 

(C) ICENTIFICATION METHOD: blastn 

(OJ OTHER INFORMATION: identity 9C 

region L. . 41 

id AA363d60 

est 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 55..L11 

(C) IDENTIFICATION METHOD: Von Heijns matrix 

(D) OTHER INFORMATION: score 4.1 

seq LLLGORCSLKVSG/OE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 



AAATCTTCAG GGCAGCTCCC AGAGCATGGA TCCCTCCTGA TTCCACTCAC CCCG ATG 57 

TTC CTC AC A GTC AAG CTG CTC CTG GGC CAG AGA TGC AGT CTG AAG GTG 105 
?he Leu Thr Vai Lys Leu Leu Leu Gly Gin Arg Cys Ser Leu Lys Val 
-15 -10 -5 

TCA GGG CAA GAG AGT GTA GCC ACG CTG AAG AGA CTG GTG TCC AGG CGG 153 
Ser Gly Gin Glu Ser Vai Ala Thr Leu Lys Arg Leu Vai Ser Arg Arg 
15 10 

CTG AAG GTG CZT GAG GAG CAG CAG CAC CTG CTT TTC CGT GGC CAG CTC 201 
Leu Lys Vsi Pro Glu Giu Gin Gin His Leu Leu Phe Arg Gly Gin Leu 
15 20 25 30 

CTG GAG GAT GAC A-AG CAC CTC TCT GkZ TAG TGC ATT GGG CCC AAT GCC 24 9 
Leu Glu Asp Asp Lys His Leu Ser As? Tyr Cys lie Giy Fro Asn Ala 
35 <0 45 

TCT ATC A-AT GTC ATC ATG CAG CCC TTG GAG AAG ATG GCG CTA AAG GAG 237 
Ser lie Asn Vai lie Met Gin Pro Leu Glu Lys MeL Ala Leu Lys Giu 

50 55 60 

GCC CAC CAG CCG CAG ACC CAG CCC CTG TJG CAC CAG CTG GGA CTG GTC 34 5 
Aia His Gin Pro Gin Thr Gin Pro Leu Trp His Gin Leu Gly Leu Val 
63 70 75 

CTA GCT AAA CAC TTT GAA CCA CAG GAT GCC AAG GCC GTG CTG CAG CTG jiJ3 
Leu Aia Lys His Phe Glu Pro Gin Asp Ala Lys Aia Vai Leu Gin Leu 
80 85 90 

CTA AGG CAG GAG CAC GAR CAG CCC CTG CAG AAG ATA AGC CTG GAG CAC ^;41 
Leu Arg Cir. Clu His Giu Glu /.rg L-u Gin Lys lie Ser Leu Giu His 
95 100 105 110 

CTG GAG CAG TTG GCC CAG TAC CTC CTG CCA GAG GAG CTC ACG TGG 436 
Leu Glu Glr. lau Ala Gin Tyr L--mi Lau Aia Giu Glu Leu Thr Trp 
115 120 125 
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{2) INFORMATION TOR 3EQ ID NO: 256: 

(i) SEQUENCE. CHARACTERrSTICS: 

(A: LENGTH: 411 base pairs 

(3: TYPE: NUCLEIC AC:D ' 

(C; S7RANDEDNESS : DOUBLE 

CD) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(VI) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: ccmpiement ( 195 . . 411 ) 

(C) IDENTIFICATION METHOD: blascn 

(D) OTHER INFORMATION: identity 99 

region 86. .302 
id AA062591 
est 

9 

(ix) FEATURE: 

(A) NAME/KSY: sig^peptide 

(3) LOCATION: 94.. 189 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(0) OTHER INFORMATION: score 4.1 

seq RLLSSLLLTMSNN/NP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 



GGCGACGCCG CCATTTTGGA GTCTTCCCTA .li.GGATCCTCT ACCGGCTTTT CGAGTCAGTG 60 

CTGCCGCCGC T3CCCGCGGC TTTGC.AGAGC AGG ATG AAT GTG .=.TA GAC CAC GTG 114 

Met Asn Val lie Asp His Vai 
-30 

CGG GAC ATG ZZQ GCC GCG GGG CTG CAC TCC PJKC GTG CGG CTC CTC AGC 162 
.^rg Asp Mec Ala Ala Aia Gly Leu Wis 5er Asn Val Arg Leu Leu Scr 
-25 -20 -15 -10 

AGC TTG TTA CTT ACA ATG AGT A,AT AAC CCT GAG TTA TTC TCC CCA 210 

Ser Leu Leu Leu Thr Met Ser .Asn .Asn Asn Pro Glu Leu Phe Ser Pro 
-5 1 5 

CCT CAG AAG TAG CAG CTT TTG GTG TAT CAT GCA GAT TCT CTC TTT CAT 2 58 
Pro Gin Lys Tyr Gin Leu Le-; Val Tyr His Aia Asp Ser Leu Phe His 
10 10 20 

G.AT A.AG G;^.A ?AT CGG .J^-AT GCT GTG AGT AAG TAT ACC ATG GCT TTA CAG 306 
Aso Lvs Gic Tvr Arg Asn Ai-r Ser Lvs Tyr Thr Met Ala \.ft\i Gin 

25 ' 31 ' 35 

CAG AAG AA:* '^ZG CTA AGT AA-. ^-'^^ T7A AAA GTr, AG A CCT TCA ACT GGA 354 
Glr. Lys Lys A.a Leu Ser Ly - T-- .%er Lys Val Arg t»rc Ser Tr.r Gly 
40 45 50 55 
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AAT TCT GCA TCT ACT CCA CAA ACT CAG 
Asn 3er Ala 3sr Thr Pro Gin Ser Gin 
60 

GTG AAA TAG 
Val Lys Tyr 
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TG? C7T CCA TCT GAA ATT GAA 4 02 
Cys Leu Pro Ser Giu lie Giu 
65 70 

41 • 



(2 J INFORMATION FOR SZQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

:A) LENGTH: 232 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNE3S: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: other ^i. 
(3) LOCATION: complement ( 18 4 228 ) »" 
CO IDENTIFICATION METHOD: biastn 
(D) OTHER INFORMATION: identity 100 

region 99. . 14 3 

id'AA122158 

est 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 56.. 173 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFOW-LATION: score 4.1 

seq RVLCPLLXAAAAP/KR 

(y.i) SciQUENCE DESCRIPTION: SEQ ID NO: 257: 



AAGTAGCTCT CTAGGCCTGG GKRCCGGAG3 GAGGGAGGCG GGCAGAGKWG GGGAG ATG 58 

Met 

GGC ACC CCC AGT CTT TCC ATC CTC CTC ATA GGG GCA CCC GAA TCC CCT 106 

Gly Thr Pro Ser Leu Ser lie Leu Leu He Gly Ala Pro Glu Ser Pro 

-40 -35 -30 -25 

ATT CCT TAT TTC CCC TAT CAC TCA GGC ACT GGC AGG GTC CTT TGC CCA 154 

lie Pro Tyr Phe Pro Tyr His Ser Gly Thr Gly Arg Val Leu Cys Pro 

-20 -15 -10 

CTC CTG T/JG GCC GCT GCG GCT CCA AAG CGA GAT GTG CCT GAG ACA GGT 202 

Leu Leu Xaa Ala Ala Ala Ala :=ro lvs Arg Asp Val Pro Glu Thr Gly 

-3 * 1 5 

TTG ACC ACC CA.^ CTC /vV. CAT CIT GGG 232 
Leu Thr Arg Gin Leu Lys Ar: His ?rc Gly 
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(2) INFORh!ATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 216 basa pairs 

(B) TYPE: NUCLEIC ACID 

(C) STR.^NDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 



(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 28. .211 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 98 

region 149. .332 
id H1507S 
esc 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 28.. 139 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 98 

region 147. ,258 
id R18367 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 138.. 179 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 92 

region 258. .299 
id Ri83o7 
est 



(i:<) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 4 6.. 123 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.9 

seq HALFVLCLLY.WS/HN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 



APATAATTGA TTCCCTGTGT CTGGAATACC TGACCCTTCC TGGAT ATG GTG TAC CAC 57 

Mat Vai Tyr His 
-25 

GCG CTC CAC ACC CCG GAT GAT GAT TAC CAT GCC CTC- TTC GTG CTC TGC 105 
Ala Leu hsp Ssr Pro Aso Asc Tyr His Ma Leu Phe Val Leu Cys 
-20 -15 -10 
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Leu Leu Tyr Aia Met Ser His Asn Lys 
-5 1 

GAG CGA ATC CAG CTC CCC GTG CCA AAT 
Giu Arg He Glr. Leu Pro Wil Prn Asn 
15 

AAC CAC CCG CAT GGG 
Asn His Pro His Gly 
30 
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GGC ATG GAT CCT GA.^^ AJ^ TTA 153 
Gly Met Asp Pro Glu Lys Leu 
5 ' 10 

GCG GCC GAG AAG ACC ACC TAC 201 
Ma Ala Glu Lys Thr Thr Tyr 
20 25 

216 



(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 base pairs 

(3) TYPE: NUC:-EIC ACID 

(C) STRANDEONESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(VI) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: complement {2 103) 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 98 

region 148. ,249 
id HS379F042 
est 

{i:<) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: 32.. '?3 

(C) IDENTIFICATION METHOD: Vcn Heijne matrix 
(0) OTHER IMTORJ'LJITION: score 3.S 

seq FIVLSMIVLCCGFE/IL 

(y.i) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 



.AACTTGGTGA CTCTAGGTGA CTGGTCGACA G ATG TTC ATT GTA CTA TCA ATG 52 

Met Piitt lie Val Leu Ser Met 
-10 

TGG CTT T3C TGT GGG TTT GAA ATT TTG CAA ACT AAG .AGT TGG GTG GCA 100 
Trp Leu Cvs Cvs Gly Pne Glu He Leu Gin Thr Lys Ser Trp Val Aia 
-5 * 1 5 . 



GGG 
Gly 



103 
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(i) SEOUEMCE CHARAC7Eii:STIC3 : 

lA) LENGTH: 351 base pairs 
(3) TYPE: NUCLEIC ACID 
(C) STRANDEDNESS : DOUBLE 
(Di TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: COMA 

(vi) ORIGINTVL SOURCE: 

(A; ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(i:<) FEATURE: 

(A) NAME/KEY: Other 
(3) LOCATION: complement (184 281 ) 
(C) IDENTIFICATION METHOD: biastn 
(0) OTHER INFORMATION: identity 93 

region 2 . . 99 
id T07232 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: complement ( 103 170) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 95 

region 113. .180 
id 707232 
est 

(ix) FEATURE: 

(A) NAME/KEY: Other 
(3) LOCATION: 42.. 106 

(C) IDENTIFICATIOM METHOD: blastn 

(D) OTHER INFORMATION: identity 96 

region 20. .34 
Id A.A0991i7 
eye 

(i:<) FEATURE: 

(A) N.At>IS/KSY: sig peptide 
(3) LOCATION: 280T.324 

(C) IDENTIFICATION METHOD: Vcn Heijne matrix 

(D) OTHER INFORMATION: score 3.0 

seq VVIL3SXVPLAAM/GV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 



AC.i-:vCCCCCC AA.AATCCCGC ATTCCTCGGG GCCACGCGCT GGGA.=.GCCTG CA-^CCG3CCC 60 

CAC.AA.ATTCT AGCAGTGCCA AGAAGAAGGA TAAAAGAGTT CAAGGTAAGC .AGTGTCAGGA 120 

TCTCTTT.^u".G GA.ACATG3TT TTCTTCTTTC ATT.ACGTGCT TTTGGAGGAA GA-AAAA.AACA 180 

GGCCAGAGAA GGGGGCCTGT GGCTTTACTT CCTTGTAGTC ACACCTGTGG GGATTCTGGG 24 0 

TCTGGCCATC CCAGCCCTG3 JiGCGAGGGCT GTGTCAGGA ATG GTG GTG GTT ATT 294 

Men V-i : tf ^ I V^: lie 
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-15 

TTG AGC AG? GYA GTT CCC TTG CCA GCC ATG GGG GTC ATG GGC TGT GTC 342 
Leu Ser Ser Xaa Val Pro Leu Ala Ala Met Gly Val Met Giy Cys Vai 
-10 -5 1 5 

GGG GTG TGG 351 
Arg Val Trp 



(2) INFORMATION FOR SSQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

:A) LENGTH: 201 base pairs 

:B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

[A) ORGANISM: Homo Sapiens 
{F) TISSUE TYPE: Ovary 

[Lk) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 16.. 62 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 97 

region 463. .509 
id AAC69619 
est 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptids 
(3) LOCATION: l..?5 

(C) IDENTIFICATION METHOD: Von Heijne matrix. 

(D) OTHER IHFOPJMATION: score 3.8 

seq AECSSLLHPSVRG/SI 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 



ATG TTG GCA GAA TGC AGT TCC TTA CTG CAT CCA TCA GTT AGA GGC TCG 48 
Met Leu Ala Glu Cys Ser Ser Leu Leu His Pro Ser Val Arg Gly Ser 
-15 -10 -5 1 

ATC CCA GAG GCC ACC TGC CGT GTC CTG CCA TGT GGC CCT CTC C.AC .AAC 96 
lie ?ro Glu Ala Thr Cys Arg Val Leu Pro Cys Gly Pro Leu His Asn 
5 10 15 

ATG GCA GTT TGC TCT TCC AAG CCT AGC AGG AGC TTC TAC TGC AAC TTC 14 4 
Met Ala Val Cvs Ser Cv.-; Lvs Ala Ser Arg Ser Phe Tyr Cys Asn Phe 
20 * 25 30 

AGA TCT CTC CGA CTT GT? GTC TCT GAC TTC TTG .ATT CTT TTC C.-A AAG 192 
Arg 33r Leu Arq Leu Aid Val Ser Asp P.Se Leu He Leu Phe Gin Lys 



^-D ^.i; 4 5 



201 
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Giy Leu Gly 
50 



•2) TMFGRMATION FOR 3EQ ID MO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 6 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STPANDEDN'ESS : DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: Other 
(3) LOCATION: 76.. 141 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 90 

region 50, . 115 
id R25850 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 39.. 141 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 96 

region 49. . 101 
id N44651 
est 

(ix) FEATURE: 

(A) NAME/KEi: other 

(B) LOCATION: 3 5.. 141 

(C) IDENTIFICATION METHOD: biastn 
(0) OTHER 1NF0P.MATI0N: identity 91 

region 38 ..94 
id N31513 
est 

(ix) FEATURE: 

(A) NAME/KEY: siq^peptide 
(3) LOCATION: 54.798 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) . OTHER INFORMATION: score 3.5 

seq M-fiRLLGLCAWARK/SV 

Cri) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 
AACTCTGTCA CCTCCGCTC-G A^.GGAGTGGA ACCCAGACTT GCTGGTCTGA TCC ATG 



56 
-15 
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Gin Met Ala. Arg Leu Leu Gly Leu Cys 
-10 

ZGG ATG GCC AGC TCC AGG ATG ACC CGC 

Arg Met Aia Ser Ser Arg Met Thr Arg 

5 10 
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228 

Aia Trp Aia Arg Lys Ser Vai 
-5 

CGG GAG CCG CCA AGG 146 • 

Arg Asp Pro Pro Arg 
15 



l2) INFORMATION FOR 3EQ ID NO: 263: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 base pairs 
(3) TYPE: NUCLEIC ACID 
(C) STRANDECNESS: DOUBLE 
(0) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CONA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(i:<) FEATURE: 

(A) NAME/KEY: Other 

(3) LOCATIOn: compiement ( 4 4 . . 83) 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 97 

region 313. .352 
id R56475 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: corapiement ( 7 3 . . 226) 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 95 

region 136 . .289 
id TC5392 
est 

{i;<) FEATUPJ:: 

(A) NAME/KEY: other 

(B) LOCATION: complement (7 3 .. 226) 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 93 

region 161 . . 314 
id HUM030E12A 
est 

(ix) FEATURE: 

(A) MAtME/KEY: other 
(3; LOCATION: complement ( 72 .. 225 ) 
(C: IDi::7IFICATI0N METHOD: biastn 
(0) OTHER INF0Ri4ATI0N: identity 92 

region 161. . 315 
id HUM016H07A 



(i:'.) FEATUP.E: 
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(B) LOCATION: complemant (1 3 1 . . 226 ) 

(C) IDZNTIt ICATION METHOD: blastn 

(D) OTHER INFORMATION*: idenCiCy 9"^ 

region 168 . .213 
id H08767 
est 

(ix) FEATURE: 

(A) NAME/KEY: orher 

(B) LOCATION: compiemeat { 39 . . 7 7 ) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 92 

region 326. . 364 
id H03767 
est 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 91.. 219 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.3 

seq LISVLYLIPKTLT/TN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 



A.ACAAAAGGA GAGTTTTATA ATTCACTTTA AAAGG.AG.ATT TGATGGT.AAA GTTTAAAGAT 60 

TAAAATATTT TGTTCT7CAA TTACAGAGCG ATG ACC CCA CAG TAT CTG OCT CAC 114 

Met Thr Pro Gin Tyr Leu Pro His 
-40 

GGT GGA AAA TAC CAA GTT CTT GGA GAT TAG TCT TTG GCA GTG GTC TTC 162 
Gly Gly Lys Tyr Gin Val Leu Gly Asp Tyr Ser Leu Ala Val Val Phe 
-35 -30 -25 -20 

CCC CTG CAC TTT TCT CAT CTA ATT TCT GTT TTA TAC CTT ATA CCC AAA 210 
3ro Le-j His Pr.e Ser Aso Leu lie Ser Vai Leu Tyr Leu lie Pro Lys 
-15 ' -10 -5 

ACA CTT ACT ACC A.AC AGC CGG 
Tnr Leu* Thr Tnr Asn Ser Arg 
1 



(2) IMFORt^TION FOR SEQ ID NO: 264: 

(i) SEQUE?:CE CHARACTERISTICS: 

lA) LENGTH: 361 base pairs 
{3/ TYPE: NUCLEIC ACID 
id STPANDEDNESS: DOUBLE 
(D) TOPOLOGY: LINEAR 

'ii: MCLE^'JLE TYPE: CDNA 

(vii 0RIGII:AL SOURCE: 

{AJ r:r<::;ANISM : Homo sapiens 

ii) rrnnwi: TY?E: spleen 
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(A) NAME/KEY: other 
*■ (Bl LOCATION: 53. .342 

(C) IDENTIFICATION METHOD: blastn 
(0) OTHER INFORMATION: identity 96 

region 19. . 309 

ic C1S012 

esc 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 123.. 34 9 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 96 

region 112 . . 338 
id .AA058608 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 22.. 83 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 91 

region 12. . 73 
id AA058608 

est > 

»' 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 103.. 331 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 92 

region 87 . .315 
id N42Q02 
est 

(ixi FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 19.. 67 

(C) IDENTIFICATION MHITHGO: blastn 

(D) OTHER INFORMATION: identify 97 

region 1.-69 
id R13567 
esc 

(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: 139.. 361 

•;C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identi-y 100 

region 25 . .247 

id .^-A1510C3 

9s»: 

(ix) FE.-.TURE: 

(A) NAME/KEY: sig^pepcida 
(3; LOCATION: 17.. 8 5 

C:) IDENTIFICATION METHOD: Von Heijne matrix 
'Z) OTHER INFORMATION: score 3.7 

sec Fl?PLXRA"ACRG/CQ 
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hAGZGGGCGT GGGGCC ATG GTG GTC TTG CGG GCG GGG APiG AAG ACC TTT CTC 52 
Met Vai Val Leu Arg Aia Gly Lys Lys Thi Phe Leu 
-20 -15 

CCC CCT CTK WGC CGC GCC TTC GCC TGC CGC GGC TGT.CAA CTC GCT CCG 100 
Pre Pro Leu Xaa Arg Ala Phe Aia Cvs Arg Gly Cys Gin Lej Ala Pro 
-ID -5 i 5 

GAG CGC GGC GCC GAG CGC AGG GAT ACA GCG CCC AGC GGG GTC TCA AGA 14 8 
Giu Arg Gly Aia Giu Arg Arg Asp Thr Aia Pro Ser Gly Vai Ser Arg 
10 15 20 

TTC TGC CC7 CCA AGA AAG TCT TGC CAT GAT TGG ATA GGA CCC CCA GAT 196 
Phe Cys Pro Pro Arg Lys Ser Cys Kis Asp Tr? lie Giy Pro Pro Asp 
25 30 35 

AAA TAT TCA AAC CTT CGA CCT GTT CAC TTT TAC ATA CCT GAA A^^.T GAA 24 4 
Lys Tyr Ser Asn Leu Arg Pro Val His Phe Tyr lie Pro Giu Asa Giu 
40 45 50 

TCT CCA TTG GAA CAA AAG CTT AGA AAA TTA AGA CAA GAA ACA C?.\ GAA 292 
Ser Pro Leu Giu Gin Lys Leu Arg Lys Leu Arg Gin Giu Thr Gin Giu 
55 60 65 

TGG A^T CAA CAG TTC TGG GCA AAC CAG AAT TTG ACT TTT ACT A^.G GAA 34 0 
Trp Asn Gin Gin Phe Trp Aia Asn Gin Asn Leu Thr Phe Ser Lys Giu 
70 75 80 95 

AAA GAA GAA TTT ATT CAC TCA 
Lys Giu Giu Phe lie His Ser 
90 



(2) IMTORt-lATION FOR S2Q ID NO: 265: 

(i) SEQUENCE CHAE^ACTERISTICS : 

(A) LENGTH: 113 base pairs 

(B) TYPS: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: ii..ll3 

(C) IDENTIFICATION METHOD: biastn 

;D) OTHER INFORMATION: identity 99 

region 2 . . 104 
id N76875 
est 

i-A) FEATURE: 

:.-.) MAME/KEY: sig^peptide 
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(B) LOCATION: 15.. 74 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.6 

seq AHLCSDSLPESQQ/QD 

(xi) SEQUENCE DESCRIPTION: SEO ID NO: 265: 



AAGAG.^GAAC CGCC ATG AAG AGA GAA GGG GGT GCC GCC CAC CTC TGC TCC 53 
Met Lys Arg Glu Gly Gly Ala Ala His Leu Cys Ser 
-20 -15 

GAG AGC CTC CCG GAG TCC CAG CAG CAA GAC GGC AAC CAC GCA CCC .AAC 93 
Aso S^r Leu Pro Glu Ser Gin Gin Gin Asp Gly Asn His Ala' Pro Asn 
-5 1 5 

TTC TCC AGC CAC GGC 
Phe Ser Ser His Gly 
10 



(2) INFORMATIOM TOR SEQ ID NO: 266: 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 342 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 



(ix) rE.i.TURE: 

(A) NAME/KEY: other 
(3) LOCATION: 255.. 34 3 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 93^^ 

region 12 . . 100 
id A.A026923 
est 

(ix) FEATURE; 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 205.. 327 

(C) IDENTIFICATION METHOD: Von Heijne natrix 

(D) OTHER INFORMATION: score 3.6 

sec PYSLAA.C?C:-SC)G/GV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 

.-.CCGAG.aAGC CCTCACAGAT GCAGATGACT TTGGCCTACA G7TCCCGCTG C^.CCTGGATG 60 
T3AGGGTGAA GGC7GTGCTG CTGGGAGCCA CATTCCTCAT TGACT.^CATG TTCTTTGAGA 120 
.i.GCG.-.GGAGG ZZ^—ZZZZ TCTGCCATCA CCAGTT.AG.AG GCC.-.CCATGG TGTGAGG.AGA IHO 
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CCATCACCTC- -GACCAGAACT CCAG ATG G7C ACC TGC CCT GGC CCC TCC TCT 231 

Met Vdl Thi- Cys Pro Oiy Pro Ser 5er 
-4C -35 

GGG CAG CCC CTT TCC TCC ATG TAC ACT GCA GGG GAG AGA AGG GGG GCC 27 9 
Giv Gin Pro Leu Set Ser Met Tvr Thr Ala Gly Asp Arg Arg Giy Ala 
-30 -25 -20 

CCA TCC CTA CCC TAC TCC CTG GCC GCC TGC CCC TGT GGT TCC CAA GGA 327 
Pro Ser Leu Pro Tyr Ser leu Ala Ala Cys Pro Cys Gly Ser Gin Gly 
-15 ' -10 -5 

GGG GTA TGT ATG AGA 342 
Gly Val Cys Met Arg 
1 5 



(2) INFORMATION TOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNES3: DOUBLE 
fO) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: complement ( 1 300) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity ICO 

region 1 . . 300 
id H13499 
est 

(ix) FEATURE: 

(A) UmZ/KLY: Other 

(8) LOCATION: complenent ( 1 . . 2 68 ) 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORM'.TION: identity ICO 

region 1 - .263 
id W40371 
est 

(ix) FEATURE: 

(A) NA:4E/KEY: other 

(B) LOCATION: complement ( 1 93 ) 

(C) IDE^^TIFICATIO^: method: blastn 
[Z) OTHER INFOPMATTOM: identity 9^ 

region 1 . . 93 
id H04223 
esr. 

(ix) TEAiURE: 

A; M.^.ME/KEY: si7_P^?"-« 
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(B) LOCATION: 109.. 162 

(C) IDENTIFICATION KSTHOD: Vcn Heijna matrix 
{D) OTHER INFORMATION: score 3.5 

seq ALEVIVTLSETAA/AM 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 

AAATCCCTCG TTGAGATTGC AGATACTGTT CCAAAGTATT TGCGTCC7CA CTTGGAAGCA 60 

ACTCTACAGC TAAGTCTAAA GTTGTGTGGA GACACTAGCC TCAACAAT ATG CAA CGC 117 

Met Gin Arg 

CAG CTT GCC CTT GAA GTG ATC GTC ACC CTC TCT GAG ACT GCA GCT GCT 165 
Gin Leu Ala Leu Glu Val lie Val Thr Leu Ser Giu Thr Ala Ala Aia 
-15 -10 -5 1 

ATG TTA AGA AAA CAT ACC AAT ATT GTT GCA CAG ACT ATT CCT CAG ATG 213 
Met Leu Arg Lvs His Thr Asn lie Val Aia Gin Thr lie Pro Gin Met 
'5 10 15 

TTA GCA ATG ATG GTT GAT TTG GAA GAA GAT GAG GAG TGG GCA AAT GCA 261 
Leu Aia Met Met Vai Asp Leu Glu Glu Asp Glu Asp Trp Aia Asn Aia 
20 25 30 

GAT GAA CTA GAA GAT GAT GAT TT7 GAC AGC PJkT GCA GTT GCA GGC GAG 309 
Asp Glu Leu Giu Asp Asp Asp Phe Asp Ser Asn Aia Vai Aia Gly Glu 
35 40 45 

AGT GCT CTA GAT CGA ATG GCT TGC GGA CTT GGT GGA AAG CTC GTT CTG 357 
Ser Aia Leu Asp Arg Met Aia Cys Gly Leu Gly Giy Lys Leu Val Leu 
50 55 60 65 

CCG ATG ATC AAG GAA CAC ATT ATG CAA ATG CTT CAA AAT CGT AAG CTG 4 05 
Pro M-t lie Lys Giu His lie Met Gin Met Leu Gin Asn Arg Lys Leu 
70 75 80 



TGT CCT TCA ATG CTA 
Cys Pro Ser Met Leu 
85 



420 



(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 392 base pairs 

(3) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hono Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: 177.. 343 

'T; IDENTIFICATION METHOD: bl-ss-^ 
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(D) OTHER INF0RMATI0r4: identity 100 

region 266. . 43*? 
id N32722 
est 

(ix) FEIATURE: 

(A) MAM£/KEY: other 
(3) LOCATION: 52.. 175 

(C) IDENTIFICATION METHOD: blasrn 

(D) OTHER INFOI^ATION: identity 94 

region 142. .265 
id N32722 
est 



(ix) FE.ATURE; 

(A) NAME/KEY: other 
(3) LOCATION: 3.. 41 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 1 . . 39 
id N32722 
est 



(ix) FEATURE: 

(A) NAME/KEY: other i' 

(B) LOCATION: 36.. 387 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 96 

region 12. .363 
id W32042 
est 



(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 177,. 34 8 

(C) IDENTIFICATION METHOD: blastn 

(D) OTHER INFORMATION: identity 100 

region 134 . .305 
Id R55254 



(IX) FE.ATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 99.. 175 

(C) IDENTIFICATION METHOD : biaszn 

(D) OTHER INFORMATION: idcnzi^iy 100 

region 5^ . . 133 
id R55254 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: 44 . . 102 

(C) IDENTIFICATION METHOD: blastn 

(01 OTHER INFORMATION: ident iry ^1 

region 1 . .5? 

id R5 5254 



(IX) FEATURE: 

lA) NAME/KEY: oiher 
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... (B) LOCATIOM: 356. .387 

(C) IDENTIFICATION K£THOD: biastn 

(D) OTHER INFORMATION: identify 50 

region 315 . . 346 

id R55254 

est 

(ix) feature;: 

(A) NAME/KEY: other 

(B) LOCATION: 177.. 334 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 100 

region 149. .306 
id W37647 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 38. . 17 5 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORMATION: identity 94 

region 11 . .148 
id W37647 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 

(B) LOCATION: 38,. 174 

(C) IDENTIFICATION METHOD: biastn 
(DJ OTHER INFORMATION: identity 95 

region 8 . . 144 
id R50622 
est 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 174, .295 

(C) IDENTIFICATION METHOD: biastn 

(D) OTHER INFORM/vTION: identity 99 

region 143. .264 
id R50622 
est 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: 1477.374 

(C) IDENTIFICATION METHOD: Vcn Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq LASASELPLGSRP/AP 

(xi) SEQUENCE DESCRIPTICM: SEQ ID NO: 268: 



AACTTCCTGT GA3CCCGGCG GTGAC.a-ACGG CA.ACATGGCC CGTGAACGGA GCTGAAGTCG 60 

ACGACTTCTC CTP.GRARMCC CCGACTGAGG CGGAGACGAA GGTGCTGC.^G GCGCGACGGG 120 

A-rr^v-^-AAnA TC-CATCTCC CGGCTC A7G t^CC GAC TJKT CTG CTG CGC GGT TAG 173 

Ket Giy Asp Tyr Leu Leu Arg Gly Tyr 
-75 -70 
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CCC ATG CTG GGC GAG ACG TGT GCG GAC TGC GGG ACG ATC CTC CTC CAA 221 
Arg yiez Leu Gly Glu Thr Cys Ala Asp Cys Gly Thr lie Leu Leu Gin 
-65 ' -60 ' -55 

GAC AAA CAG CCG AAA ATC TAC TGC GTG GCT TGT CAG G.AA CTC GAC TCA 269 
Asp Lvs Gin Arg Lys lie Tyr Cvs Val Ala Cys Gin Gla Leu Asp Ser 
-50 -45 ■ -40 

GAC GTG GAT AAA GAT AAT CCC GCT CTG AAT GCC CAG GCT GCC CTC TCC 517 
Asp Val Aso Lys Asp Asn Pro Ala Leu Asn Ala Gin Ala Aia Leu Ser 
-35 ' -30 -25 -20 

CAA GCT CGG GAG CAC CAG CTG GCC TCA GCC TCA GAG CTC CCC CTG GGC 365 
Gin Ala Arg Glu His Gin Leu Aia Ser Ala Ser Glu Leu Pro Leu Gly 
-15 -10 -5 

TCT CGA CCT GCG CCC CAA CCC CAC GGG 392 
Ser Arg Pro Aia Pro Gin Pro His Gly 

1 .5 



(2) INFOW^ATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 base pairs 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 61.. 232 

(C) IDENTIFICATION METHOD: fasta 

(D) OTHER INFORMATION: identity 100 

region 1 . . 172 
id H5C1R 
vrt 

iist) FEATURE: 

(A) NAME/KEY: other 

(3) LOCATION: 72.. 232 

{C) IDENTIFICATION METHOD: fasta 

{D) OTHER INFORMATION: identity 100 

region 24 . . 134 
id HUMCIR 
vrt 

(ix) riATURE: 

;A) NAME/KEY: ochar 

(3) LOCATION: 109.. 2 32 

{O IDENTIFICA'^*''in Mr.THOC: hl;:»srn 

(J) OTHER INFG?.:-i.VTIC?;: identity 92 

region 1 . . 124 
id T7075 
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est 

(ix) FEATURE: 

(A) NAME/KEY: other 
(3) LOCATION: 98..14i 

(C) IDENTinCATION METHOD: biastn 

(D) OTHER INFORMATION: identity 93 

region 1 . . 44 
id T64778 
est 



(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 112.. 156 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 8.1 

seq LLYLLVPALFCRA/GG 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 



AACTCCACAG AAAACCCTCC CCTCCCTGCT GTGCATGACG CGGGCTCCCT CTGCACACAG 60 

TGCACGAAGA CGCTGTCGGG AGAGCCCAGG ATTCAACACG GGCCTTGAGA A ATG TGG 117 

Met Trp 
-15 

CTC TTG TAC CTC CTG GTG CCG GCC CTG TTC TGC AGG GCA GGA GGC TCC 165 
Leu Leu Tyr Leu Leu Val Pro Ala Leu Phe Cys Arg Ala Gly Gly Ser 
-10 -5 1 

ATT CCC ATC CCT CAG AAG TTA TTT GGG GAG GTG ACT TCC CCT CTG TTC 213 
He Pro lie Pro Gin Lys Leu Phe Gly Glu Val Thr Ser Pro Leu Phe 
5 10 15 

CCC AAG CCT TAC CCC AAC ACG 234 
Pro Lys Pro Tyr Pro Asn Thr 
20 25 



(21 INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 302 base pairs 

(B) TYPE:. NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(i:0 CE.ATURE: 

(A) NAME/KEY: other 

(3) LOCATION: / . .300 

(C) IDENTIFICATION METHOD: fasta 

(C) OTHER INFOP.MATION: identity 93 

region I . .214 
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id HSCALICIN 
vrt 

(ix) FEATURE: 

(A) MAME/KEY: sig peptide 
(3) . LOCATION: 78.7251 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4 

seq L.i^V3PLVRSLIS/SN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 



AAATCTGATC CCACAGGCCT GAGAAAGTCT GCTCTCCAGW ACCTGCTGCT GATCTGTT7C 60 

AGCCGACAAG AGGCACC ATG AAA TTG GAA TTC ACG GAG AAA AAC BAC RAT 110 
Met Lys Leu Glu Phe Thr Glu Lys Asn Xaa Xaa 
-55 -50 

AGC TTC GTG CTG CAR AAC CTG AAC AG A CAG AGG AAA CGC AAA GAG TAC 158 
Ser Phe Val Leu Gin Asn Leu Asn Arg Gin Arg Lys Arg Lys Glu Tyr 
-45 -40 -35 

TGG GAC ATG GCC CTG ACT GTG GAC AAC CAC GTC TTC TTT GCA CAT CGC 206 
Trp Asp Met Ala Leu Ser Val Asp Asn His Val Phe Phe Ala His Arg 
-30 -25 -20 

AAT GTG CTG GCT GCT GTC TCC CCA CTG GTG AGG AGC CTC ATC TCC AGC 254 
Asn Val Leu Ala Ala Val Ser Pro Leu Val Arg Ser Leu He Ser Ser 
-15 -10 -5 1 

AAT GAC ATG AAG ACC GCT GAT GAG CTT TTC ATC ACC ATT GAC ACC AAG 302 
Asn Asp Met Lys Thr Ala Asp Glu Leu Phe He Thr He Asp Thr Lys 
5 10 15 



(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY; LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -30.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
;D) other INFORMATION: score 13.2 

seq LLLLSTLVIPSAA/AP 

(xi) SEQUENCE DESCRIPTIOti: SZQ ID NO: 271: 



Met Gly Glr.i Aia Ser Pro tm AI^ Pro Ala Arg Arg His Leu Leu Val 
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-30 -25 -20 -15 

Leu Leu Leu Leu Leu Ser Thr Leu Vai He ?ro Ser Ala Ala Aia Pro 
-10 -5 L 

He His Asp Ala Asp Ala Gin Giu Ser Ser Leu Gly Leu Thr Gly Leu 
5 10 15 

Gin Ser Leu Leu Gin Gly Phe Ser Arg Leu Phe Leu Lys Gly Asn Leu 
20 25 30 

Leu Arg Gly He Asp Ser Leu Phe Ser Ala Pro Met Asp Phe Arg Gly 
35 40 45 50 

Leu Pro Gly Asn Tyr His Lys Glu Glu Asn Gin Glu His Gin Leu Gly 
55 ■ 60 65 

Asn Asn Thr Leu Ser Ser Xaa Leu Gin He Asp Xaa Met Thr Asp Asn 
70 75 80 

Lys Thr Gly Glu Val Leu He Ser Glu Asn Val Val Ala 
85 90 95 



(2) INFORMATION FOR SSQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix> FEATURE: 

(A) NAME/KEi: siq^pepCicit; 

(B) LOCATION: -21.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 12 

seq VLVLCVLLLQAQG/GY 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 



Met Ala Pro Gin Thr Leu Leu Pro Val Leu Vai Leu Cys Val Leu Leu 

-20 -15 -10 

Leu Gin Ala Gin Gly Gly Tyr .A.rg Aso Lys Me:: Arg Met Gin Arg He 

-5 1 5 10 

Lys Val Cys Glu Lys Arg Pro Ser He .^.sp Leu Cys He His His Arg 

15 20 25 



(2) INFORMATION FOR SSQ TD NO: 273: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: ICi ammo acids 

(B) TYPE: AMINO ACID 
"(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

[vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -19.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 11 

seq SLVLLLCLTCSYA/FM 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 



Met Trp Thr Leu Lys Ser Ser Leu 
-15 

Ser Tyr Ala Phe Met Phe Ser Ser 
1 5 

Gin Gly Lys Val Gin Tyr Gly Glu 
15 20 

Pro Glu His Thr Gin Gly Trp Leu 
30 35 

Xaa Val Val Val Pro Phe Val lie 
50 

Lys Asn Lys Glu Gin Ser Pro Pro 
65 

Ser Pro Leu Lys Lys 
80 



Val Leu Leu Leu Cys Leu Thr Cys 
-10 -5 

Leu Arg Gin Lys Thr Ser Glu Pro 
10 

His Phe Arg lie Arg Gin Asn Leu 
25 

Gly Ser Lys Trp Leu Trp Leu Leu 
40 45 

Leu Gin Cys Gin Arg Asp Ser Glu 
55 60 

Gly Leu Arg Gly Gly Gin Leu His 
70 75 



(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQOENCE CHARACTERISTICS: 

<A) LENGTH: 115 amino acids 
(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/'KcY: sig_pepcide 

(B) LOCATIOtJ: -167.-1 

(C) IDENTIFICATION METHOD: Von Heijne matri:'. 
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(D) OTHER INFORMATION: score 10.6 

seq LLLLPLLWGGSLQ/EK 

(xi) SEQUENCE DESCRIPTIOM: SEQ ID NO: 274: 



Met Leu Pro -Leu Leu Leu Leu Pro Leu Leu 7rp Gly Gly Ser Leu Gin 
-15 -10 -5 

Glu Lys Pro Vai Tyr Giu Leu Gin Val Gin Lys Ser Vai Thr Vai Gin 
1 5 10 15 

Glu Gly Leu Cys Val Leu Vai Pro Cys Ser Phe Ser Tyr Pro Trp Arg 
20 25 30 

Ser Trp Tyr Ser Ser Pro Pro Leu Tyr Vai Tyr Trp Phe Arg Asp Gly 

35 40 45 

Glu lie Pro Tyr Tyr Ala Giu Vai Val Ala Thr Asn Asn Pro Asp Arg 
50 55 60 

Arg Val Lys Pro Glu Thr Gin Gly Arg Phe Arg Leu Leu Gly Asp Val 
65 70 75 80 

Gin Lys Lys Asn Cys Ser Leu Ser lie Gly Asp Xaa Arg Met Glu Asp 
85 90 95 

Thr Gly Gly 



(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 
(3) TYPE: AMINO ACID 
<D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: -26?. -1 

(C) IDENTIFICATION METHOD: Von H-ijne matri:< 

(D) OTHER INFORMATION: scoie 10.4 

seq LLLLLCGPSQDQC/R? 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 j. 



Giu Thr Gly Ala Leu Arg Arg Pro Gin Leu Leu ?ro Leu Leu Leu 
-25 -20 -15 



Leu Lau Cys Gly Pro Scr Gin Asp Gin Cys Arg Pro Val Leu Cin Asn 
-10 -5 1 5 

Leu Leu Gin Ser Pro Ciy Leu Thr Trp Ser Leu Glu Vai Pro Thr Gly 



I 
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..10 15 20 

Arg Glu Giy Lys Glu Gly Thr Met Arg Val 3er Pro Thr Ala Pro Arg 
25 30 35 



(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -16.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 10.1 

seq LVLTLCTLPLAVA/SA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 



Met Glu Arg Leu Val Leu Thr Leu 
-15 -10 

Ser Ala Gly Cys Ala Thr Thr Pro 
1 5 

Cys ?he Lys Vai Ser Ser Trp Thr 
20 

Pro Leu Aso Gin Val Cys lie Ser 

35 40 

Trp Ser Val Arg Val Leu Leu Ser 
50 55 

Asn Ser Gly 
65 



Cys Thr Leu Pro Leu Ala Val Ala 
-5 

Ala Arg Asn Leu Ser Cys Tyr Gin 

10 15 

Giu Cys Pro Pro Thr Trp Cys Ser 
25 30 

Asn Glu vai Val Val Ser Phe Lys 
45 

Lys Arg Cys Ala Pro Arg Cys Pro 
60 



(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 
(3) TYPE: A^:INO ACID 
(0) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL COURCE: 

(A) ORG.V.iISW: Memo Sapiens 
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(D TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -23 /.-I 

(C) rOENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 9.8 

seq FLLFFFLFLLTRG/SL 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 



Met Met Leu Pro Gin Trp Leu Leu Leu Leu Phe Leu Leu Phe Phe Phe 
-20 -15 -10 

Leu Phe Leu Leu Thr Arg Giy Ser Leu Ser Pro Thr Lys Tyr Asn Leu 
-5 15- 

Leu Glu Leu Lys Glu Xaa Xaa Xaa Gly Asn Gin Asp Cys Glu Thr Giy 
10 15 20 25 

Cvs Cys Gin Arg Ala Pro Asp Asn Cys Giu Ser His Cys Ala Glu Lys 
30 35 40 

Gly Ser Glu Giy Ser Leu Cys Gin Thr Gin Val Phe Phe Gly Gin Tyr 
45 50 55 

Arg Ala Cys Pro Cys Leu Arg Asn Leu Thr Cys He Tyr Ser Lys Asn 
60 65 70 

Giu Lys Tr? Leu Ser He Ala Tyr Gly Arg Cys Gin Lys He Gly Arg 
75 ' 80 85 

Gin Lys Leu Ala Arg Lys Cys Ser 
90 95 



(2) IMFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYet:: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAI-IE/KEY: sig^peotide 

(B) LOCATIOM: -22.. -i 

(C) IDEMTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 9.3 

seq LVVFCLALQLVPG/S? 

l:<i) SEQUE::':::; description: seq id NO: 278: 
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Met Lys Pro Vai Leu Pro Leu Gin Xaa Leu Val Vai ?he Cys Leu Ala 
-20 -15 -10 

Leu Gin Leu Val Pro Gly Ser Pro Lys Gin Leu Gly 
-5.1 5 



(2) IMFORMATIOM FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 aniino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -35.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix i' 

(D) OTHER INFORMATION: score 9.5 

seq LFFSL?SAPLASA/VR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 



Met Phe Arg Gin Arg Gin Giu Thr Ala Gin Arg Ser Thr Gin Ser Cys 

-35 -30 -25 -20 

Arg Cys Pro Arg Asp Gly Leu Phe Phe Ser Leu Phe Ser Ala Pro Leu 

-15 -LC -5 

Ala Ser Ala Val Arg Ala Ala Xaa 

1 5 



(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 
(a) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

iA) ORGANISM: Hono Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) F£ATU?.£: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -26.. -I 

iC) IDENTIFICATION METHOD: Von Heijne matrix 
iD) OTHER IMFORi-LATION: score 9.4 

seq RLLLALPL.MVLC/FE 
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(xi) SEQUENCE DESCRIPTION': SEQ ID NO: 280: 



Met Giy Ser Ser Ala Cys GXu lie Ala Val Gly Thr Lys Arg Leu Leu 
-25 -20 -15 

Leu Ala Leu Pro Leu Ala Leu Val Leu Gly Phe Glu Gly Ser Ser Val 
-10 -5 1 5 

Pro Pro Arg Asn Phe 
10 



(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) MAME/KEY: sig^peptide 

(B) LOCATION: -23.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORRATION: score 9.4 

seq 3LLFICFFGESFC/IC 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 



Met Ser Asn Gin Arg Leu Pro Leu lie Phe Ser Leu Leu Phe lie Cys 
-20 -15 -10 

Phe Phe* Gly Giu Ser Phe Cys lie Cys Asp Gly Thr Val Trp Thr Xaa 
-5 15 

Val Xaa Trp Giu lie Leu Pro Giu Giu Vai His Tyr Trp Lys Val Lys 
10 15 20 25 

Giy Ser Pro Ser His Cys Leu Arg 
30 



(2) INFORMATIO:; FOR SEQ ID NO: 282: 

(i) SEQ-jEMCE CHARACTERISTICS; 

LENGTH: 20 amino acids 

(3) type: amino acid 
topology : linear 

(ii) MOLECvLE TYPE: PROTEIN 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: -IST.-I 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 9.2 

seq FLSFLLALL3LNC/IP 

(xi) SEQUENCE DESCRIPTION: SEQ -ID NO: 282: 



Met Leu Trp Phe Leu Ser Phe Leu Leu Ala Leu Leu Ser Leu Asn Cys 
-15 -10 -5 

He Pro He Gly 
1 



(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptic!e 
(3) LOCATION: -24.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(0) OTHER INFORMATION: score 9.2 

sec ICCVIVLISLSWT/SP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 



Met Leu Xaa He Ser Leu Glu He Xaa Ser ?t\e He Cys Cys Vai He 
-20 -15 -10 

Val Leu He Ser Leu Ser Trp TUr Ser Pro Phe Thr Gly Val Tyr Leu 
-5 L 5 

He Gly Leu He He Glu Pro Gly 
10 15 



(2) INFORMATION FOR SEQ ID NO: 234: 

(i) S£Q*J£NCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
•; = . TYPE: AMINO ACID 
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(D) TOPOLOGY: LINEAR 

fii) MCI-ECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Horr.o Sapiens • 
(F) TISSUE TYPE; Ovary 

fix) rSATURE: 

(A) NAME/KEY: sig^pepticie 

(B) LOCATION: -19. .-I 

(C) IDENTIFICATION METHOD: Von Heijne nacrix 

(D) OTHER INFORMATION: score 9.2 

seq ILFILTFrSHTFC/SR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 



Met Val Phe Arg Asn Cys lie Leu Phe He Leu Tnr Phe Phe Ser His 
-15 -10 -5 

Thr Phe Cys Ser Arg Gin Asn Lvs Ala Gin Pro Tro 
1 5 

i" 

(2) INFORMATION FOR SEQ ID NO: 285: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 51 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(31 LOCATION: -137". -1 

(C) IDENTIFICATION METHOD: Von Hei^ne matrix 
{Dl OTHER INFORf-UTION: score 9.1 

seq MLAACPLSPGCQS/A? 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 



Met Leu Ala Ala Cys Pro Leu Ser Pro Giy Cys Gir. Ser Aia Pro Ser 

-10 -5 1 

T'.r Trp Asn His Phe Pro Pro Giu Arg lie Thr Thr Glv Ala Giy Ser 

5 10 15 ' 



Leii Leu Lys Pro Giy Giy Giy Leu Tro Pro Arg Thr Val Ser Leu Pro 
20 25 ' 30 35 



vr ?ro Aia 
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12) INFORMATION FOR SEQ ID NO: 2S6: 

(-) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 76 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(B') LOCATION: -19.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 9.1 

seq FLTLITHCTVSWA/QS 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 



Met Aia Trp Ser Pro Leu Phe Leu 
-15 

Ser Trp Aia Gin Ser Val Leu Thr 
1 5 

Pro Arg Gin Arg Val Thr lie Ser 
15 20 

Giy Arg Asn Ala Val Asn Tro Tyr 
30 35 

Arg Leu Leu lie P.^xe Tyr Asn Asn 
50 



Thr Leu lie Thr His Cys Thr Val 
-10 -5 

Gin Pro Pro Ser Val Ser Glu Aia 
10 

Cys Phe Gly Ser Ser Ser Asn lie 

25 

Gin Gin Leu Pro Giy Arg Ser Pro 

40 45 

Leu Pro Ala Ser 
55 



(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A} ORGANISM: Hcr?.o Sapiens 
(F) TISSUE TYPE: Tescls 

.{i:<) rZAT'JRE: 

(Ai NAME/KEY: sig^pepcide 
(3) LOCATION: -13..-! 

iZ] IDENTIFICATION METHOD: Vcn ileijne r'.^zrix 
(D: other INFORMATION: score 5.1 

seq LV3LC3WS??ir$/3? 
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Met Leu Lys Ser Val Leu Vai Ser Leu Cys Ser Trp Ser Pro Pro Leu 
-15 -10 -5 

Thr Ser Ser Pro Arg 
I 



(2) INFORMATION FOR SEQ ID NO: 288: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 4 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

iiy.) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -22." -1 

IC) IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score 9 

seq FILAALSLSTTFS/LQ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 



Met Thr Ser Lys Xaa lie Leu Vai 
-20 -15 

Leu Ser Thr Thr Phe Ser Leu Gin 
-5 1 

Val Ser Phe Asp Giy Phe Arg Tro 
15 



Ser Phe lie Leu Ala Ala Leu Ser 
-10 

Pro Tyr Gin Gin Lvs Val Leu Leu 

5 ' 10 

Asp Tyr Leu Tyr 
20 



(2) INFORMATION FOR SEQ ID NO; 28 9: 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) t<CL£CULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Korr.o Sapiens 
(?) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A; NAME/KEY: sia^peptide 
:=} LOCATION: -2C..-i 
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(C) ID51NTIFICATI0N METHOD: Von Hei^ne matrix 

(D) OTHER INT0RM2VTI0N: score 3.9 

seq LAVXLGLATAV3A/GP 

(xi) SEQUEMCE DESCRIPTION: SEQ ID NO: 239: 



Met Lys Ser Leu Ser Leu Xaa Leu Ala Val Xaa Leu Gly Leu Ala Thr 
-20 -15 .5 

Ala Val Ser Aia Gly Pro Ala Trp 
1 



(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: -2lT.-l 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 8.8 

seq LLWALLFMQSLWP/QL 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 



Met Trp Ala Met Giu Ser Gly His 
-20 -15 

Gin Ser" Leu Trp Pro Gin Leu Thr 
-5 1 

Leu Giy He Arg Asp Val Gin Trp 
15 

Val He Thr Asn Gin Pro Leu Asp 
30 35 

Leu Lys Ser Asp Lys Asn Arg lie 
45 50 



Leu Leu Trp Ala Leu Leu Phe Met 
-10 



Asp Gly Ala Thr Arg Val Tvr Tyr 
5 io 

Asn Tyr Ala Pro Lys Gly Arg Asn 
20 25 



Ser Asp lie Val Ala Ser Ser Phe 
40 

Giy Gly Thr Thr Arg Arg Pro Tro 
55 



(2) INFORMATION FOR SEQ ID MO: 291: 

(i) SEOUEMCF. CHARACTERISTICS: 

{A} LENGTH: 49 ammo acids 
(3) TYPE: AMINO ACID 
(Dj TOPOLOGY: LINEAR 
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(Li) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) N.AME/KEY: sig peptide 

(B) LOCATION: -2o7.-l 

(G) IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score 3.8 

seq LLVMGSLPSASWS/LP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 



Met Ala Gin Thr Trp Ala Xaa Leu Leu Val Met Gly Ser Leu Pro Ser 
-20 -15 -10 -5 

Ala Ser Trp Ser Leu Pro Cys Leu Ser Tro Giu Ser Leu Leu Lys Ala 
1 5 * 10 

Ala Ala Cys Ser Glu Leu Asp Gly Arg Asn Val Gly Asn Thr Pro Thr 
15 20 25 

Arg 



(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: siq_peptide 
(3) LOCATION: -22.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 8.7 

seq LITLIYVWPVINA/CQ 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 



Met Lys Cys Gly Phe Leu Ala Tyr Leu Leu lie Thr Leu Leu Tyr Val 

-20 -15 -10 

Trp Pro Val lie .Asn Ala Cys Gin 

-5 1 
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(2) IvNFORMATION FOR SEQ ID NO: 293: 

(i) SEOCJENCE CHARACTERISTICS: 

(A) LENGTH: 36 amine acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(Al ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(i:<) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -25. .-1 

(C) IDENTIFICATION METHOD: Von Heijae matrix 

(D) OTHER INFORMATION: score 8.5 

seq LKVLLLPLAPAAA/QD 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 



Met Arg Lys Pro Ala Ala Giy Phe Leu Pro Ser Leu Leu Lys Vai Leu 
-25 -20 -15 -10 

Leu Leu Pro Leu Ala Pro Ala Ala Ala Gin Aso Ser Thr Gin Ala Ser 
-5 1 5 

Thr Pro Gly Arg 
10 



(2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

fii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/ KEY: sig peptide 

(B) LOCATION: -177. -1 

(C) IDENTIFICATION METHOD: Von rteij-e matrix 

(D) OTHER INFORMATION: score 6.4 

seq LLFLTSVVP-VLA/PR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2)4: 



Met Arg Gin Ser Leu Leu Phe Leu Thr Scr Vil Val Pro Phe Val Lsu 
-15 -10 -5 

AU Pro Arg Pro Pro Asp Asp Pro Gly Phe Gly Pro His Gin Arg Leu 



wo 99/0^9 PCT/IB98/0I231 

254 

1 - 5 10 15 

Giu Lys Leu Asp Ser Leu Leu Ser Asp Tyr Asp lie Leu Set Leu Ser 
20 25 30 

ksn lie Gin Gin Gin Xaa 
35 



(2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 38 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE:. 

(A) NAhk/KEY: sig^peptide 
(3) LOCATION: -25.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 8.1 

seq SVLLGLLALMATA/AV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 



Met Giu Leu Ser Gin Met Ser Giu Leu Met Gly Leu Ser Val Leu Leu 
-25 -20 -15 -10 

Gly Leu Leu Ala Leu Met Aia Thr Ala Ala Val Ala Arg Gly Tro Leu 
-5 1 5 ' 

Arg Ala Gly Giu Val Arg 
10 



(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: flomo Sapiens 
(?) TISSUE TYPE: Testis 

(i;0 FE.^TURE: 

(A) NAME/KEY: sig peptide 
:3) LOCATION: -65?. -1 

[Z) IDENTIFICATION METHOD: Von Kci:nG matri:-: 
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Gly ieu Gly Phe 
-45 

Leu Val Ser Trp 
-30 

Ala Val Gin Ala 
-15 

Arg Val Ser Arg 
1 

Leu Val Val Tyr 
20 

Ser Val Trp Phe 
35 



(2) INFORMATION FOR SEQ ID NO: 298: 
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He lie Ser Ser 

His Ser Trp Val 
-25 

Leu Cys Gly Leu 
-10 

Val Ala Arg Leu 
5 

Leu Met Ala Thr 



Arg Thr Arg Ser 
-40 

Gly Ala Leu Thr 



Cys Leu Leu Cys 
-5 

Lys Leu Tyr His 
10 

Val Thr Val Leu 
25 



Giu Leu Pro His 
-35 

Leu Leu Ala Thr 
-20 

Pro Arg Ala Ala 



Leu Thr Cvs Gly 

is 

Leu Gly Met Tyr 
30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: 3ig__peptide 

(B) LOCATION: -57.. -1 

(C) IDENTIFICATION METHOD: Von Heljne matrix 

(D) OTHER INFORMATION: score 7.5 

seq LLHRLASFHRVW3/FP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 



Met Leu Arg Phe Pro Thr Cys Phe Pro Ser Xaa Arg Val Xaa Gly Xaa 
-55 -50 -45 

Lys Gin Leu Pro Gin Giu lie lie Xaa Leu Val Trp Ser Pro Xaa Arg 
-40 -35 -30 

Asp Xaa lie Xaa Leu Ala Asn Thr Ala Gly Giu Val Leu Leu His Arg 
-25 -20 -15 -10 

Leu Ala Ser Phe His Arg Val Trp Ser Phe Pro Pro Asn Giu Asn Thr 
-5 1 5 

Gly Xaa Giu Val Thr Cys Leu Ala Trp Arg Pro Asp Gly Lys Leu Leu 
10 15 " 20 " 



Ala Phe Ala Leu Ala Asp Thr Lys Lys lie Val Leu Cys Asp Val Giu 

25 " 3C 35 
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Lys Pro Giu Ser 
40 



(2) INFORMATION FOR SEC ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 aniino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
{B) LOCATION: -427.-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 7.5 

seq LALVVALVAERFA/RR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 



Met Phe Met Val Leu Giu Val Val Val Ser Arg Val Thr Ser Ser Leu 
-40 -35 -30 

Ala Met Leu Ser Asp Ser Phe His Met Leu Ser Asp Val Leu Ala Leu 
-25 -20 -15 

Val Val Ala Leu Val Ala Giu Arg Phe Ala Arg Arg Thr His Aia Thr 
-10 -5 1 5 

Gin Lys Asn Thr Phe Gly Trp He Arg Ala Giu Val Met Gly Ala Leu 
10 15 20 

Val Asn Ala lie Phe Leu Thr Gly Leu Cys Phe Ala He Leu Leu Giu 
" 25 30 35 

Ala He Giu Arg Phe lie Giu Pro His Giu Met Gin Gin Pro Leu Val 
40 45 50 

Val Xaa Trp Gly Arg Ala Tro Xaa Ala Ala Giy Gin Arg Ala Giy Ala 
55 60 65 70 

Leu Pro Leu Pro Pro Ser Gin Arg Leu Gin Pro Giy Leu Arg Pro Arg 
75 80 35 

Pro Trp 



(2? INFORMATION FOR SEQ ID NO: 300: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 7 9 amine acids 
(BJ TYPE: AMINO ACID 
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(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -39.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 7.5 

seq LLLLLGLIVLVNI/GI 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 



Met Giu Asn Gin Leu Tr? His Asn Thr Val Arg Cys Cys Asn Gin Tyr 

-35 -30 -25 

Gin Glu Ser Pro His Asp Ala Glu Asp lie Leu Leu Leu Leu Leu Gly 
-20 -13 -10 

Leu He Val Leu Val Asn lie Gly He Asn Val Ala Thr Met Met Trp 
-5 1 5 

His Gly Leu Gin Asn Ala Leu Asp Lys Met He Asp Trp Ala Thr Gin 

10 15 20 25 

Lys He Ala Val Phe Phe Ala Val Phe Vai Ala Ala Ala Ala Arg 

30 35 O 



(2) INFORMATION FOR SEQ ID NO; 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: AMINO ACID 
(0) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -18?. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 7.5 

seq ITLLTLSPNSVCC/CP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 



Met Leu Ser Xaa Lvs He Thr Leu Leu Thr Leu Ser Pro Asn Ser Val 
, -15 " -10 
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Cys Cys Cys Pro Ser Ala Thr Leu Giy Ala 5er Asn His 5er His Leu 
15 10 

Trp Arg Ser Thr Ser Arg His Giy lie Ser ?he Pro Tro Ala Phe Leu 
15 20 25 30 

Leu lie Asn Giy 



(2) INFORMATION FOR SEQ ID NO: 302: 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY; LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -18 . .-1 

(C) IDENTIFICATION METHOD: Von Hsijne matrix 
(0) OTHER INFORMATION: score 7.4 

seq GWLVLCVLAI5LA/SM 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 302: 



Met Glu Giy Pro Arg Glv Trp Leu Val Leu Cvs Val Leu Ala He Ser 
-15 -10 -5 

Leu Ala Ser Met Val Thr Glu Asp Leu Cys Arg Ala Fro Asp Giy Lys 
15 10 

Lvs Giy Glu Ala Giy Xaa Pro Giy Arg Arg Giy Arg Pro Giy Leu Lys 
15 , 20 25 30 

aiy Glu Gin Arg 



(2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
CD TISSUE TYPE: Testis 

(ix) rZATU?.E: 

;A) liAilE/KiY: sig_?ep::ide 
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- (3) LOCATION: -20.. -1 

(C) IDENTIFICATION METHOD: Von Heijne natrix 
ID) OTHER INFORMATION: score 7.3 

sec LAVFMLLAQLVSG/NW 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 



Mer Lys Ser Leu 
-20 



Leu Vai Ser Giy 



lie Cys Lys Lys 
15 



Trp Ala Met Cys 
30 



Arg Aia Asn Tyr 
45 



Ser Trxr Vai Thr 



Ala Ser Met Ser 
80 



Leu Phe Thr Leu 
-15 



Asn Trp Tyr Vai 
1 



Lys Cys Lys Pro 
20 



Gly Lys Gin Arg 
35 



Pro Vai Phe Cys 
50 



Ala Thr Thr Ala 
65 



Ser Met Ala Pro 



Ala Vai Phe Met 



Lys Lys Cys Leu 
5 



Glu Glu Met His 



Asp Cys Cys Vai 
40 



Vai Gin Thr Lys 
55 



Thr Thr Thr Leu 
70 



Thr Arg Phe Ser 
85 



Lsu L(»M Ala Gin 
-5 

Asn Asp Vai Gly 
10 

Vai Lys Asn Giy 
25 

Pro Aia Asp Arg 



Thr Thr Arg lie 
60 

Met Met Thr Thr 
75 



(2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCL CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(9) LOCATION: -16.. -1 

(C) IDENTIFICATION METHOD: Von Keijne matrix 
(0) OTHER INFORMATION: score 7.3 

seq LILLFSLLISIVC/MI 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO; 304: 



Met Leu Lys Leu lie Leu Leu Phe Ser Leu Leu lie Ser lie Vai Cys 
-15 -10 -5 



Met lie 
1 
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(2) INrORMATION FOR S£Q ID NO: 3C5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

Ux) FEATURE: 

(A) MAME/KEY: sig peptide 

(B) LOCATION: -267.-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 7.1 

seq LASLQWSLTLAWC /GS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 



Met Thr Pro Tr? Cys Leu Ala Cvs Leu Gly Arg Arg Pro Leu Aia Ser 
-25 -20 -15 

Leu Gin Trp Ser Leu Thr Leu Aia Tro Cys Gly Ser Gly Ser His Trp 
-10 -5 ' 1 5 

Thr Glu 



(2) INFORMATION FOR SEQ ID NO: 306: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(li) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A! ORGANISM: Homo Sapiens 
{F5 TISSUE TYPE: Spleen 

lix) FEATURE: 

(A) NAME/KEY: 3ig_peptide 
(3) LOCATION: -26..-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.9 

seq LWVLLLCAHVVTL/LV 

i>:i; SEQl-EMCE DESCRIPTION: SEQ ID NO: 305: 



Mat Thr y?^.^ Arc; His Asn Trp Thr Pro Asc Leu Ser Pro Leu Tro Val 
-25 -20 -15 

-0- L-2U Cy,^ i-;? His Vai Vai Thr Lau Leu Val Arg Ala Thr Pro 
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-ID -5 15 

Val Ser Gin Thr Thr Thr Ala Ala Thr Ala Ser Val Arg Ser Thr Lys 
10 13 20 

Asp Pro Cys Pro Ser Gin Arg 
25 



(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE; 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -20.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6,9 

seq LFCATLSCMPATS/AP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 



Met Thr Gly Asn Asn Arg Asp Leu Phe Cys Ala Thr Leu Ser Cys Met 
-20 -15 -10 -S 

Pro Ala Thr Ser Aia Pro His Met Lys Leu Pro Asp lie Ser Phe His 
15 10 

Leu Pro Gly 
15 



(2) INFORMATION FOR SEQ ID NO: 308: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
{?) TISSUE TYPE: Ovary 

(iK) FEATURE: 

•A) NAJ4E/KEY: sig_peptide 
(3i .LOCATION: -26. .-1 

(C) IDENTIFICATION METHOD: Von Hei-ir.-5 matrir^ 
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(D) OTHER IN-FORMATION: score 6.9 

seq LWVLLLCAHVVTl/LV 

(xi) SEQUENCE DESCRI PTIOSI ; SEQ ID MO: 308: 



Met Thr Met Arg His Asn Trp Thr Pro Asp Leu Ser Pro Leu Trp Vai 
-25 -20 -15 

Leu Leu Leu Cys Ala His Val Val Thr Leu Leu Val Arg Ala Thr P-o 
-10 -5 1 5 

Val Ser Gin Pro Thr 
10 



(2) INFORKATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 
(3) TYPE: AMINO ACID 
(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(r ) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_pepcide 

(B) LOCATION: -20.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.9 

seq LYLLGMLVPGGLG/YD 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 



Met Lys Pro Leu Leu Glu Thr Leu Tvr Leu Leu Glv Met Leu Val Pro 
-20 ' -15 -10 * -5 

Gly Gly Leu Gly Tyr Asp Arg Ser Leu Ala Gin His Arg Gin Glu He 
15 10 

Val Asp Lys Ser Vai Ser Pro Trp Ser Leu Glu Thr Tyr Ser Tyr Asn 
15 20 25 

He Tyr His Pro Met Gly Glu He Tyr Glu Trp Me- Arg Glu lie Ser 
30 35 4C 

Giu Lys Tyr Lys Glu Val Vai Thr Gin His Phe Leu Gly Vai Thr Tyr 
4 5 50 55 60 

Giu Thr Gin Pre Ala 
65 



(2) INFORt-i:i.T:o:: FOR SEQ ID NO: 310: 
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(i) SEQUENCE CHARACTERISTICS: 

iA) LENGTH: 72 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hotr.o Sapiens 
ir) TISSUE TYPE: Spleen 

;ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: -65^.-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.8 

seq LLFLISLAAHLSQ/WT 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 



Met Asn Gin Ala Asp Pro Arg Leu Arg Ala Val Cys Leu Trp Thr Leu 
-65 -60 -55 -50 

Thr Ser Ala Aia Met Ser Arg Gly Asp Asn Cys Thr Asp Leu Leu Ala 
-45 * -40 -35 



Leu Gly He Pro Ser He Thr Gin 
-30 

Gly Ala Val Thr Leu Leu Phe Leu 
-15 -10 

Gin Trp Thr Arg Gly Arg Ser Gly 
1 5 



Ala Trp Gly Leu Trp Val Leu Leu 
-25 -20 

He Ser Leu Ala Ala His Leu Ser 



(2) INFORZ-L^TION k'OR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 amino acids 
{3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) t^*OLECULE TYPE: PROTEIN 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
.;?) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: siq^peptide 
:3) LOCATION: -4 2. .-1 

[Z) IDENTIFICATION METHOD: Vcn Heijne matrix 
:D) OTHER INFORMATION:, score 6.3 

seq LICILSSITMVIC/RH 

C^i) ZZOUZi:ZZ DESCRIPTION: SEQ ID NO: 311: 
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Met His Arg Gin lie Ser Phe Lea Leu Leu Arg Lys Pre Arg Lys Asn 

-35 -30 

Trp Phe Cys Gin Asn His Val Asn Leu Arg Lvs Arg Tyr Leu Leu Ser 
-25 -20 ' -15 

lie Leu Ser Ser Leu Thr Met Vai He Cys Arg His Giy 
-10 -5 1 



(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
is) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptids 
(H) LOCATION: -43. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.8 

seq ALSAXTFVSFLHA/AP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 



Met Lys Gin Tr? Leu Cys Trp Val Leu Arg Leu Glu Gly Arg Gin Giy 
-40 -35 -30 

Leu Gly Vai Gly Glu Pro Arg Glv Leu Arg Leu Cys Ltju Gly Ala Leu 
-25 -20 -IS 

Ser Aia 'Xaa Thr Phe Val Ser Phe Leu His Ala Ala Pro His Ser His 
-10 -5 1 5 

Pro Ala Leu Gly 



(2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

{ii: MOLECULE TYPE: PROTEIN 

(vi.i ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 
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(ix). feature:: 

(A) NAME/BCEY: sig^oeptide 

(B) LOCATION: -66/.-1 

(C) IDENTIFICATION METHOD: Von Heijne .tiacrix 
(D| OTHER INFORMATION: score 6.8 

seq LLFFLFPIiriRS/QH 

(KiJ SEQUENCE DESCRIPTIOM: SEQ ID MC: 313:. 



Met Arg Leu Giy Leu Cys Phe Trp Val Pro His Arg Glv Giu Met Ser 
-65 -60 -55 

Phe Ser Ser His Tyr Ser Arg Giy Thr Trp Tvr Gin Trp Asp Leu Ser 
-50 -45 -ic -35 

Leu Leu Met Leu Thr Leu lie Ser Trp Phe Arg Trp Cys Leu Pre Aia 
-30 ■ -25 -20 

Val Ser Thr Vai Giu Leu Leu Phe Phe Leu Phe Pro He Leu Phe lie 
-15 -10 -5 

Arg Ser Gin His Arg 
1 



(2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -101.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.6 

acq IIIVITIT3AC5A/CI 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 



.Met .Asp Phe Trp Giu Giu Tyr Arg Arg Giy Asp Vai Pro Phe Ser Trp 

-100 -95 * -9c 

Cys Pro lie Arg Ser Tyr Leu Met Ser Val Cys Pre Vai Thr Giy Lvs 

-35 -80 -75 -70 



Vai Asn Leu Asn His Leu Val Lys Vai Aia Ser Aia Arg Phe Leu His 
-65 -50 -S5 



Gin Val Thr lie Phe Pro Phe Leu Tyr Ser Val Lys kia As.-i Tyr Cys 
-30 -45 -4") 
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(A: LSNGTH: 92 amino acids 
(3) TYPE: AMINO ACID 
iO) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(r) TISSCJE TYPE: Spleen 

(ix) FEATURE: 

•:A) name/ KEY: sig^peptide 
(3) LOCATION: -26.. -1 

(C) IDENTIFICATION METHOD: Von Hei^ne matrix 

(D) OTHER INFORMATION: score 6.5 

seq LALGSAGLLWCIA/GF 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 



Met Vai Phe Ala Thr lie Giy Phe Ser Leu Lys Ser Gly Leu Ala Leu 
-25 -20 -15 

Gly Ser Aia Gly Leu Leu Trp Cys Leu Ala Gly Phe Phe Gly Tyr Asp 
-10 -5 1 5 

Thr Gin Gin Pro Thr Ala Pro Asn Ala lie Giu Gly Tyr Arg Val Met 
10 15 20 

Ser Ser Phe Giy Val Gly Aia Leu Phe Ala Aia Cys Thr He Cys Leu 
25 30 35 

Leu Ala Xaa Lys Leu Asn Lys Gin Thr Thr Leu Lys Met Ala Asp Asp 
40 45 50 

Leu Ala Gin Arg Arg Gin Gin Ala Asp Leu Aia Pro 
55 60 65 



(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: AMINO ACID 
(0) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

<Yi) ORIGINAL SOURCE: 

{A) ORGANISM: Homo Sapiens 
JF) TISSUE TYPE: Testis 

(i:c| FEATURE: 

(A) NAME/?:EY: sig^peptide 
;B) LOCATION: -uT.-l 

(C) IDENTiriCATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.4 

seq VLLLSGSVSVGVC/CA 

r^'Q'.:Z::CZ D£3C?.:?TI0M: seq id NO: 317: 
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Met Val Leu Leu Leu Ser Giy Sf^r Val Ser Val Giy Val Cys Cy.s Ala 
-10 -5 1 

Tyr Leu Cys lie Ser lie Ser Lys Thr Pro Thr Ala Cys Ala Leu Tyr 
5 10 15 

Giy Leu Tyr Leu Pro Phe ?he 
20 25 



(2) IMFORMATIO^^ FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -297. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.4 

seq GLCXLCXVXNVFA/GS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 



Met Cys Ser Gin Lys Arg Ala val 
-25 

Giy Leu Cys Xaa Leu Cys Xaa val 
-10 

Pro Giy Lys Ser His Cys His Ser 

5 10 



Ser Asn Gin Giy Leu Met Asp Leu 
-20 -15 

Xaa Asn val Phe Ala Giy Ser Met 
-5 1 

Pro Phe ser He Asn Gin Giy Arg 
15 



(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LZMEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapienc 
vF) TISSUE TYPE: Ovary 

(ix! FEATURE: 
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- (A) NAME/KEY: sig^peptide 

(B) LOCATION: -14..-! 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.4 

seq LIVLTLK3PSCDT/AQ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31^: 



Met Leu lie Vai Leu Thr Leu His Ser Pro Ser Cys Asp Thr Ala Gin 
-10 -5 1 

Glu Giu Met Gly Arg Val Pro Thr Thr Pro Lys Cys Arg Trp Lys Leu 
5 10 15 

Gly Leu Ser Met Cys Ser Leu Leu Thr Pro Gly 
20 25 



(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 anino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -62.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(0) OTHER INFORMATION: score 6.4 

seq SVLWLGALGLTIQ/AV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 



Met Thr Arg Leu Cys Leu Pro Arg Pro Glu Ala Arg Glu Asp Pro lie 

-60 -55 -50 

Pro Val Pro Pro Arg Gly Leu Gly Aia Gly Giu Giy Ser Gly Ser Pro 

-45 -40 -35 

Val Arg Pro Pro Val Ser Thr Trp Gly Pro Ser Trp Ala Gin Lau Leu 

-30 -25 -20 -15 

Asp Ser Val Leu Trp Leu Gly Ala Leu Giy Lsu Thr lie GI.- Ala Val 

-10 -5 1 

Phe Ser Thr Thr Gly Pro Aia Leu Leu Lej Leu Leu Val Ser Phe Leu 

5 10 15 

Thr Phe Aso Leu Leu His Arg Pro Aia Vai Thr Leu Cys His Ser Aia 

20 * 25 30 
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Asn Phe Ser Pro Giy Ala Arg Val Arg Gly Pro Val Lys Val Leu Asp 
35 40 45 SO 

Ser Arg Arg Leu Tyr Ser Cys Lys Trp Vai Gin Ser 
53 60 



(2) INFORMATION FOR SEQ ID NO: 321: 

ii) SSQUiNCo CHAR.^CT£RISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig__peptidfi 

(B) LOCATION: -IST.-I 

(C) IDENTIFICATION METHOD: Vcn Heijne matrix 

(D) OTHER INFORMATION: score 6.4 

seq LTCLFLSLISTYP/SC 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 



Met Val Leu Thr Cys Leu Phe Leu Ser Leu lie Ser Thr Tyr Pro Ser 
-15 -10 -5 1 

Cvs lie Tar Leu Phe Leu Ser Lvs lie Pro Asn Pro Leu Ser Ser Leu 
5 ' 10 15 

Pro Ser Leu 
20 



(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acid:> 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ho.Tic Sapians 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/ KEY: sig^peptide 
(3) LOCATION: -18,. -1 

(C) IDENTIFICATION METHOD: Von Heijne aa-riy. 

(D) OTHER INFORMATION: score c.3 

seq FSF3LCLLS3SST/J!? 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 



Met Leu lie Pro Val Phe Ser Phe Ser Leu Gia Leu Leu Ser Ser Ser 
-15 -10 -5 

Ser Thr Asn Pro Val Asn Ser Thr Phe Gin Met Pro Phe Glu Ser Ser 
1-5 10 

His Kaa Thr Thr Arg 
15 



(2) INFORMATION FOR SEQ ID NO: 323: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: -477. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.3 

seq LLLLESVSGLLQP/RT 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 



Met Ala Ala Ala Xaa Leu Ser Gly Pro Ser Ala Gly Ser Ala Ala Gly 
-45 -40 -35 

Val Pro" Gly Gly Thr Gly Gly Leu Ser Ala Val Ser Ser 3iy Pro Arg 
-30 -25 -20 

Leu Arg Leu Leu Leu Leu Glu Ser Val Ser Gly Leu Leu Gin Pro Arg 
-15 -10 -5 1 

Thr Gly Ser Ala Val Ala Pro Val His Pro Pro Asn Arg Ser Ala Arg 

5 • 10 15 



(2) IMFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

t^OLECULE TYPE: PROTEIN 
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(vi)- ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSCE TYPE: Ovary 

{ix> FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: -15?. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.2 

seq NWLFLFVFTFCNC/FF 

(xi) SEQUENCE DESCRIPTION: SZO 10 NO: 324: 



Met His Asn Trp Leu Phe Leu Phe Val Phe Thr Phe Cys Asn Cys Phe 
-15 -10 -5 1 

Phe Lys Asn Asn Gly 
5 



(21 INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(i:<) FEATURE: 

{Pi) NAME/KEY: sig peptide 

(B) LOCATION: -177.-1 

iC) IDENTIFICATION METHOD: Von Heijne natri:< 
(D) OTHER INFORMATION: score b.2 

seq CFYFLSTALG5QA/DS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 



Met Kis Val Glu Cys Phe Tyr Phe Leu Ser Thr Ala Leu Gly Ser Gin 
•15 -10 -5 

Ala Asp Ser Trp Val Ser Giy Leu Gin Gin Aia Gly l«u Leu Pry Ala 
1*5 10 15 

lie Gly Tyr Arg 



{2i IMFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 
(3) TYPE: AMINO ACID 
(0) TOPOLOGY: LINEAR 



wo 99/06549 



274 



PCT/IB98/01231 



(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANIStM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

{ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -19.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.1 

seq LALLWSLPASDLG/RS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 



Met Ser Pro Gly Ser Ala Leu Ala Leu Leu Tro Ser Leu Pro Ala Ser 
-15 -10 * -5 

Asp Leu Gly Arg Ser Val lie Ala Gly Leu Trp Pro His Thr Gly Vai 
1 5 10 

Leu lie His Leu Glu Thr Ser Gin Ser Phe Leu Gin Gly Gin Leu Thr 
15 20 25 

Lys Ser lie Phe Pro Leu Cys Cys Thr Ser Leu 
30 35 40 



(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(yi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

Cix) FEATURE: 

{A) NAME/KEY: sig_peptide 

(B) LOCATION: -13..-! 

(C) IDENTIFICATION METHOD: Von Heijne macri:< 

(D) OTHER INFORMATION: score 6.1 

seq MALALGSIPSSIA/SS 

(xi) SEQUENCE DESCRIPTIOM: SEQ ID NO: 327: 



Met .Ma Leu Ala Leu Gly Ser lis 
-10 

Vii His Vol Ser His Phe Cys Pro 
5 10 

Gin 5ar Thr Pro Lys 



Pro Sec Ser He .-.la Ser Ser Trp 
-5 1 

Cys Leu Leu His Thr Thr Leu Pro 
15 
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(2) INFORMATION FOR SZQ ID NO: 328: 

;i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: -15.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.1 

seq FLFCTLFSLVVHP/SH 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 



Met Leu Ala Phe Leu Phe Cys Thr Leu Pae Ser Leu Val Val .His Pro 

-15 -10 -5 

Ser His lie Asp Leu Lys Cys Ser Phe Tyr 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: siq_peptide 
(3) LOCATION: -36.. -1 

(C) IDENTIFICATION METHOD: Vcn Heij/.e matri;'. 
iO) OTHER INFOPa-IATIOM: score 6 

seq LLYTLQ-::33L3G/CF 

(xi) SEQUENCE DESCRIPTIDj: : SEQ ID KO: 32.^: 



I'liz Ala Gin Met Pro Leu Thr Glv Ser Tvr Gin A.-:; leu Glu Tvr Phe 
-35 -30 * ' -21 
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Leu Glu Cys Met Phe Leu His Leu Leu Tyr Thr Leu Gin Thr lie Ser 
-20 -15 -LO -5 

Ser Leu Ser Giy Cys ?he Lys Gin Phe ?he Phe Gin Leu Asn Cys ?he 
1 5 10 

Cys Trp Gly Glu lie Leu Trp His Ser Ser Phe Leu His Ser Giy Ser 
15 20 25 

Cys Leu Leu Val Leu Leu lie Lys Lys Lys Lys lie Tyr Leu Gin Ser 
30 35 40 

Xaa Xaa lie Tyr Thr Gly Tyr Xaa lie Asp Xaa Xaa Xaa Leu Xaa Xaa 
45 50 55 60 

Phe Ser lie Pro Leu Ser Phe lie Gin Phe 
65 70 



(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHAPACTERI5TICS : 

(A) LENGTH: 20 ami.no acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vij ORIGINAL SOURCE: 

(A) ORGANISM: Hoxo Sapiens 
(?) TISSUE TYPE: Ovary 

{ix] FEATURE: 

(A) NAME/ KEY: sig peptide 
(31 LOCATION: -15T*-L 

(C) IDENTIFICATIOM METHOD: Von Heijne matrix 

(D) OTHER INF0RI4ATI0N: score 6 

seq LLMGLVr/RTVLQG/KE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 30: 



Met Aia Leu Leu Met Gly Leu Tro Vai Arg Thr v.3i Leu Gin Gly Lys 
-15 -10 ' -5 1 

Glu Ala Ser Gly 



C2: INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

«:.-.) LENGTH: 27 ar.ino acids 
ih) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii; MOLECULE TYPE: PP.CTEIN 

{vi; 0?.:GINAL SOURCE: 
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(A) ORGANISM: Mono Sapiens 
(D TISSUE TYPE: Oterus 

(ix> FEATURE: 

(A) ^mME/KEY: sig peptide 
(3) LOCATION: -19".-i 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6 

seq LAILIXSLKLTIG/IQ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 



Met lie Asn His Leu Tyr Leu Ala He Leu He Xaa Ser Leu Lys Leu 

-15 -10 -5 

Thr He Gly He Gin Lys Arg Phe Gly Pro Pro 
1 5 



(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/ECEY: sig_Deptide 

(B) LOCATION: -50.. -1 

(C) IDENTIFICATION METHOD: Von Kei^ne macrix 

(D) OTHER INFORMATION: score o 

seq LLYLC5r?LPGT3/LF 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 3.32: 



Met Gly Arg Gin Gly Thr Leu Glu lie Giu Glv He Leu Cys Val He 
-50 -45 -^0 -35 

Thr Trp Leu Giu Ala Asn Leu Gly Lys Gin Lys As? Giu Asn His Tyr 
-30 -25 -20 

Tyr Lys Lys Leu Ser Leu Leu Tyr Leu Cys Ser r^ie Pro Leu Pro Gly 
-15 ' -10 -5 

Thr Ser Leu Phe Leu Leu Cys Ser Phe Ser Tyr Leu Thr Gin Arg Leu 

I 5 10 



Ser Gin Gly Gly Gly 
15 
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(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -39.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.9 

seq SAWWCVLL£WSQG/AS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 



Met Glu Leu Thr Asn Lys Gin Thr Giy Thr Asp Arq His Giu Gin Vai 
-35 -30 -25 

Leu Arg Arg Val Lys Gin Asp Lys Arg lie Ser Ala Trp Trp Cys Vai 
-20 -15 -10 

Leu Leu Giu Trp Ser Gin Giy Ala Ser Leu Arg Arg Gin His Arg Giy 
-5 1 5 

Glu Thr Ser Pro Lys Ser Giy Glu Arg Leu Ser Arg Gin Arg Giu Gin 
10 15 20 25 

Gin Lys Pro Gin Met Ser Asp Lys Ser Leu 
30 35 



(2) INFORMATION FOR SEQ ID NO: 334: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Saoiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: -227.-1 

(C) IDENTIFICATION METHOD: Von ilei^r.e matrix 

(D) OTHER INFOPsMATION: scora 5.9 

seq VlGLLcSrSDTWA/PA 



(xi; SEQUENCE DESCRIPTION: SEQ ID NO: rj; 
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Met Ala Lys Arg Gin A3n Pro Thr Ser Val Leu Gly Leu Leu Phe Ser 
-20 -15 -10 

lie Ser Asp Thr Trp Ala Pro Ala Vai Ser Ser Trp lys Ala 31u Ala 
-5 13 10 

Lys Asp Gly Ala Asp Gin Glu As? Ala Arg Xaa Xaa Ser Gin Arg Ser 
15 20 25 

Pro Xaa Ser Thr Ala Gly Ser Gin Glu Pro Tyr Phe Tro Phe Val Trp 
30 35 40 

Val Glu Gly Glu Gly Arg 
45 



(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(iil MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/ECEY: sig_peptide 
(3) LOCATION: -25.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORRATION: score 5,9 

seq rCLSLQIFRVSLA/LA 

{:<i) SEQUENCE DESCRIPTICM: SEQ ID NO: 335: 



Met Asn Val Leu Pro Phe Ser Tyr Tyr Tyr lio Leu Phe Cys Leu Ser 

-25 -20 -15 -10 

Leu Gin He Phe Arg Val Ser Leu Ala Leu Ala Xaa Thr His Glu Val 
-5 1 5 

Pro Val Ser Thr His Thr Asn Xaa Leu His 
10 15 



(2) INrOFjyiATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHAR?.CTERISTICS : 

(A) LENGTH: 53 amino acids 
(3) TYPE: AI'IIMC ACID 
(0) TOPOLOGY: LINEAR 



:ii) r^.OLECULE TYPE: PROTEIN 
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(vi) ORIGINAL SOURCE: 

ik) ORGANISM: Hcmo Sapiens 
(?) TISSUE TYPE: Ovary 

(A) NAME/KEY: sig peptide 
■3) LOCATION: -297.-1 

iC) IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score 5.9 

seq FSYISXFLSPVCG/CS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 



Met Lys Cys Leu Lys Vai Asn Pro Phe Leu Phe Leu Val Phe Asn Phe 
-25 -20 -15 

Phe Ser Tyr lie Ser Xaa Phe Leu Ser Pro Val Cys Gly Cys Ser Vai 
-10 -5 1 

Cys Asn Leu Lys His Trp Glu Asn Glu Leu Leu Phe Pro Ser Pro His 
5 10 15 

Phe Leu Pro Tyr Lys Phe Xaa Phe Leu Phe 
20 25 



(2) IN?0R>L2lTI0N FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hono Sapiens 
(?) TISSUE TYPE: Testis 

(ix) cEATURE: 

(A) NAME/KEY: sig^pepcide 
(3) LOCATION: -33.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
CD) OTHER INFORMATION: scoz^. 5.8 

seq XLCLGMALCPRQA/TR 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO; 337: 



y.ei S-r Trc T.-.r Val Pro Val Val Arg Aia Ser Gin Arg Val Ser Ser 

-30 -25 -20 

Vai Giy Aia .-.sn Xaa Leu Cvs Leu Gly Met Aia Leu Cys Pro Arg Gin 

-15 -10 -5 

Aia T>.r Arg lie Pro Leu .Asn Gly Thr Trp Leu Phe Thr Pro Val S^-r 

1 5 * iO 15 
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(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: AMINO ACID 
(0) TOPOLOGY: LINEAR 

(id) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/ECEY: sig peptide 
{B) LOCATION: -IST.-I 

(C) IDENTIFICATION METHOD: Von Heijne natrix 

(D) OTHER INFORMATION: score 5.8 

seq FLXLMTLTTKVHS/SA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 



Met Gly Phe Leu Xaa Leu Met Thr Leu Thr Thr His Vai His Ser Ser 
-15 -10 -5 1 

Ala Lys Pro Asn Gly 
5 



(2) IMFORMi.TION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 ar>ino acids 
(3) TYPE: AMINO ACID 
(D| TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: -IsT.-I 

(C) IDENTIFICATION METHOD: Von Heijne natxix 
(C) OTHER INFORMATION: score 5.7 

sec RVLLLAQLFLGSG/KT 

(y.ii SEQUENCE DESCRIPTION: SEQ ID HO: 339: 



Met Leu Phe Arg Vai Leu Leu Leu Ala Gin Leu Phe Leu Gly o-ir Gly 
-15 -10 -5 
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ays Tiir Leu Arg Thr Pro 
i 5 



(2) INFOW-IATION FOR SEQ ID NO: 340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptidle 

(B) LOCATION: -32.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matriy. 

(D) OTHER INFORMATION: score 5.7 

seq SL?LSTSA?PLRG/LR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 



Met Arg Val Pro Glu Asp Leu Ala 
-30 -25 

Ala Pro Gly Ser Leu Pro Leu Ser 
-15 -10 

Leu P.rg Leu Lys Glu His Pro Gly 
1 5 

Ala Cys Pro Glu Thr Pro Ala 
20 



Ser Lys He Leu Leu Pro Gly Cys 
-20 

Thr Ser Ala Pro Pro Leu Arg Gly 
-5 

Arg Gly Pro Ser Ser Pro Lys Ala 
10 15 



(2) IMFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(Vi) ORIGINAL SOURCE: 

{A) ORGANISM: Homo Sapien:^ 
(F) TISSUE TYPE: Ovary 



(ix) FEATURE: 

(A) NAME/KEY: sig_peptici*» 

(B) LOCATION: -31.. -1 

(C) IDENTIFICATION METHOD: '.or Hriine matrix 
tO) OTHER INFORM.A.TION: scor^- 3.7 

s*..: 30:-CLCQCrLARA/I;D 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 



Met Phe Pro His Xaa Giu Thr Gin Val Lys Cys Phe Trp Glr. Giy Leu 
-30 -25 -20 

Arg Arg Ser Asp Leu Cys Leu Cys Gin Cys lie Leu Ala Arg Ala His 
-15 -10 -5 1 

Asp Giy Asp Leu Tyr Leu Phe Phe 
5 



(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -20.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORZ^TION: score 5.6 

seq LAVFMXLAQLVSG/NW 

(.xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 



Met Lys Ser Leu Leu Phe Thr Leu Ala val Phe Met Xaa Leu Ala Gin 
-20 -15 -10 -5 

Leu Val Ser Giy Asn Trp Tyr Val Lys Lvs Cys Leu Asn Xaa Phe Giy 
15 10 

lie Cys Lys Xaa Lys Cys Lys Pro Giu Glu Met His Vai Lys Asn Giy 
15 20 25 

?ro Xaa Met Cvs Giy Lys Gin Arg Asp Cys Cys Val Pro Ala Asn Giy 
30 ' 35 40 



(2) INFORMATION FOR SEQ ID NO: 343: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: PROTEIt: 
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<vir^ ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sigjpeptide 

(B) LOCATION: -23..-! 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
CD) OTHER INFORMATION: score 5.6 

seq LLNVACCIPFSSS/LF 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 3: 



Met His Leu Tyr Ser Cys Ser Cys Met Arg Leu Leu Asn Val Aia Cys 
-20 -15 -10 

Cys lie Pro Phe Ser Ser Ser Leu Phe Pro His lie Leu Phe Lys Ser 
-5 1 5 

Leu Asn Tvr Ser Leu Thr Ser Phe Leu Lys Ala Val Arg Gly Arg Trp 
10 ' 15 20 25 



(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sigjpeptide 

(B) LOCATION: -16..-! 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.5 

seq PLVLSPLSYQCSS/QG 

(xi) SEQUENCE DESCRIPTION: SZQ ID NO: 344: 



Met Arg Ala Pro Leu Val Leu Ser Pro Leu Ser Tyr Gin Cys Ser Ser 
-15 -10 -5 

Gin Gly His lie Trp 
1 5 



:2) INFORMATION FOR SEQ ID NO: 345: 



(i) SEQUENCE CHARACTERISTICS: 

:A) LEJIGTH: 39 amino acids 
{d) r/PE: AMINO ACID 
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(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGIWAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -36..-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(0) OTHER INFORMATION: score 5.5 

seq FTSMCILFHCLLS/FQ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 



Met Gin Vai Pro His Leu Arg Val Trp Thr Gin Val Xaa Asp Thr Phe 
-35 -30 -25 

lie Gly Tyr Arg Asn Leu Gly Phe Thr Ser Met Cys lie Leu Phe His 
-20 -15 -10 -5 

Cys Leu Leu Ser Phe Gin Arg 
1 



(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -367. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.4 

seq LWLMHQSFQKSNS/SS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 



Met Gin Lys Leu Met Ala Val Pro Met lie Thr Arg Ala Gin Gly Gly 
-35 -30 -25 

Asp Thr Cys Thr Arg Gin He Leu Trp Leu Met His Gin Ser Phe Gin 
-20 -15 -10 -5 

Lys Ser Asn Ser Ser Ser Thr Ser Tyr Cys Ser Ala Gin Gly 
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(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENC^: CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -45. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.4 

seq AHRSLCLWPACLC/AR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 



Met Cys Xaa Ala Giy Phe Xaa Asp His Pro Arg Ala Ala Arg His Ala 
-45 -40 -35 -30 

Arg Thr Ser Arg His Pro Leu Pro Trp Val Cys Val Ser Gin Xaa Pro 
-25 -20 -15 

Ala His Arg Ser Leu Cys Leu Trp Pro Ala Cys Leu Cys Ala Arg Val 
-10 -5 1 

Leu Pro Pro Ala Pro Gly 
5 



(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
{?) TISSUE TYPE: Testis 

(ix) FIATURE: 

(A) NAME/KEY: sic_peptide 
(3) LOCATION: -187. -1 

(C: IDENTIFICATION M.ETHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score 5.4 

b-q ILVSFILAALSLS/TT 

(xi) SEQUENCE DESCRIPTIC:: : :j£0 ID NO; 343: 
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Met Thr Ser Lys ?he lie Lea Vai 3er Phe lie Leu Ala Ala Leu Ser 
-15 -10 

Leu Ser Thr Thr lie Giy 
1 



(2) INFORMATION FOR SEQ ID NO: 34 9: 

{i) SEOUEMCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) MAME/KEY: sig^peptide 

(B) LOCATION: -15. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.4 

seq LLIrlLTVHHTPS/LP 

(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 349: 



Met His Leu Leu lie Phe lie Leu Thr Vai His His Thr Pro Ser Leu 
-15 -10 -5 1 

Pro Ser 



{2) INFORMATION FOR SEQ ID NO: 3S0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FE-ATURE: 

(A) NAME/KEY: sig^r.eptide 
(3) LOCATION: -lb. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.3 

seq SSLM^/QLISQVYS/CM 

(xi) SEQUENCE OESCRIPTIOii : ZZO 13 t-'O: 350: 
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Met Leu Ser Ser Ser Leu Met Val Gin Leu lie Sec Gin Val Tyr Ser 
-15 -10 -5 

Cys Met Arg Arg 
1 



(2) INFORMATION FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 
(31 TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN' 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(Lk) FEATURE: 

(A) NAME/KEY: sig^peptide ^ 

(B) LOCATION: -14,. -1 

(C) IDENTIFICATION METHOD: Von Heijae matrix 

(D) OTHER INFORMATION: score 5.3 

seq FSYILCMLFCLFS/QD 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 



Met Phe Ser Tyr lie Leu Cys Met Leu ?he Cys Leu Phe Ser Gin Asp 
-10 -5 1 

Lys Phe Leu Gla Val Thr Leu Leu Cys Glu Arg Tyr Met Leu 
5 10 15 



(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 4 5 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Saoiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptice 
(3) LOCATION: -19. . -1 

(C) IDENTIFICATICM METHOD: Von Heijne matrix 

(D) OTHER INFOR:wriON: score 5.2 

sec VTLAFSLLVLSES/AV 

(xi) SEQUENCE DESCRI?TIO:i : SEQ ID MO: 352: 
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Ket Leu Phe Leu Tyr Tyr Vai Thr Leu Ala Phe Ser Leu Lsu Val Leu 
-15 -10 -5 

m 

Ser Glu Ser Ala Val Leu Lys Arg Arg Glu lie Phe Xaa Thr Gly Leu 
1 5 10 

Gly Cys Val Thr Gly Leu Gly Cys Val Thr Gly Leu Arg 
15 20 25 



(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hcmo Sapiens 
(Fl TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sigjpeptide 

(B) LOCATION: -14.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.2 

seq LLSGLWLSSVKEC/DD 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 



Met Leu Leu Ser Gly Leu Trp Leu Ser Ser Val Lys Glu Cys Asp Asp 
-10 -5 1 

Trp Arg Ala Asp Gly Cys Leu Pro Ser lie Vai His Pro Leu Arg 
- 5 10 15 



(2) INFORM.ATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sicj^pepcide 
(3) LOCATION: -13.. -1 

(C) IDENTIFICATION XETHOD: Von Heiine matrix 
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- (D) OTHER INFORMATION: score S.2 

seq VFCrSWLMS3SS?/SI 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 



Mez Val Ala Phe Ser Val Phe Cys Phe Ser Trp Leu Met Ser Ser Ser 
-15 -10 ' -5 

Ser Pro Ser He Phe Trp Ser His Piie Tyr Ser Pro Phe Lys Asp Leu 

1 5 10 

Ser Lys Met Tyr Asn Tyr Val Ser Pro 

15 20 



(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -16?. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
{D) OTHER INFORMATION: score 5.1 

seq LALGIGPPGCLQG/SP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 



Met Val Pro Leu Ala Leu Gly He Giy Pro Pro Giy Cys Leu Gin Giy 
-15 -10 -5 

Ser Pro Ser Gin Trp Leu Val Arg Ala Fro Gly Ala Gin Leu Ser Pro 
15 10 15 

He Gly Val Ala Thr Glu Arg Glu Gin Arg 
20 25 



(2) XNEORr-LATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

■A) LENGTH: 41 amino acids 
(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

Hi) MOLECULE TYPE: PROTEIN 



(vii ORIGINAL SOURCE: 
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- (A) ORGANISM: Hcmo Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: -32?. -1 

;C) IDENTIFICATION METHOD: Voa Heijne matrix 
iO) OTHER INFORMATIOM: score 5.1 

seq LLWFCTAMRPGGA/GL 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 



Met Asn Leu Cys Met Gly Val Leu Leu Lys Vai Gly Thr Ser Arg Arg 
-30 -25 -20 

Cys Leu Cys Leu Leu Trp Phe Cys Thr Ala Met Arg Pro Gly Gly Ala 
-15 -10 -5 

Gly Leu Pro Asn Ala Thr Pro Glu Trp 
1 5 



(2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hcmo Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

CA) NAME/KEY: sig^peptide 
(3) LOCATION: -14. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.1 

seq SLAKSLFLRVARG/LG 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 



Met Ser Leu Ala Lys Ser Leu Phe Leu Arg Vai Ala Arg Gly Leu Gly 
-10 -5 1 



(2) INFORMATION FOR SEQ ID NO: 358: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 
;B) TYPE: Af«NO ACID 
(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE): PROTEIN 
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(vi)- ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
:B) LOCATION: -18. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
■0) OTHER INFORMATION: score 5.1 

seq FLPSATLLLSAES/FF 

{xi: SEQUENCE DESCRIPTION: SEQ ID NO: 358: 



Met Arg Leu Pro Pro Phe Leu Pro Ser Ala Thr Leu Leu Leu Ser Aia 
-15 -10 -5 

Glu Ser Phe Phe Arg Ser Vai Ser Glu Tyr Pro Ser Leu Pro Ser Pro 
1 5 10 

Ser Aia Giy Giy Pro Gly Cys Vai Ser Giy Lys Trp Gly Ser Gly Ser 
15 20 25 30 

Asn Giy Pro Leu Ser Ser Leu Ser Cys Ser Leu Cys Arg Lys Pro Leu 
35 40 45 

Leu His Ser Thr Ala Leu Ser Ser Ser Arg Pro Phe Phe Ser Pro Glv 
50 55 60 

Phe Pro Cys Gin lie Ser Pro Arg Ser Gly Leu His Pro Leu 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPt: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -4 9..-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5 

seq PLLLLLREELVTG/AV 



(xi) SEQUENCE DESCRIPTION: 

Met Ger Asp Arg Lys Arg Thr Lys 
-45 

Pro lie Ser Leu Leu Pre Ai-j Ssr 



SEQ ID NO: 359: 

Phe Ser Tyr Val Gin Leu Pro Cys 
-40 -35 

Phe Lys Arg Giy Gin lie Pro Cly 
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■■ -30 -25 -20 

Pro Ser Ala Pro Pro Leu [,eu Leu Leu Leu Arg Glu Giu Leu Val Th" 
-15 -10 _5 

9 

Giy Aid Vai 
1 



(2) INF0RC4ATI0N FOR SEQ ID NO: 360: 

(i) SSQUINCS CHARACTERISTICS: 

(A) LENGTH: 7 9 amiao acids 
(3) TYPE: Ai^INO ACID 
(D) TOPOLOGY: LI WEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: -4 1. .-1 

(C) IDENTIFICATIO>J METHOD: Von Heijne matrix 

(D) OTHER INFORt^J\TION: score 5 

seq FCFFPAFLVXVXS/QP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 



Met TMr Pro Lsu Giy Ser Giy Pro Pro Arg Giu Ala Ser lie Ala Gin 

-■10 -35 -30 

Val Arg Giy ?ne Ser Arg Thr ?he Phe Arg Vai Ala Phe Cys Phe Phe 

-25 -20 -15 -10 

Pro Ala Phe Leu Val Xaa val xaa Ser Gin Pro Ser Giy Phe Ser Thr 

-5.1 5 

Thr Giu Thr Leu Cys Ala Gin Asp Phe Ser Asp Val lie Phe Leu Arg 

10 15 20 

Arg Ala Asp Thr Arg Arq Tro Lys Lys Lys Gin Leu Arg Arg Arg 

25 30 35 



(2) i::?ORMATICM FOR SEQ ZD ;:C : 361: 

:i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 
(3; TYPE: A:iINO ACID 

{d: tofolcgv: linear 

:ii) MOLECULE TY^S: ?=OTEIN 

'••'i; c?.:ginal SQ:;r'CE: 

(A: ORGAMIi::: H:i-.o Saoians 



wo 99/06549 PCr/IB98/0t231 

294 

(F) TISSUE TYF£: Tesris 

{IX) rSATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION*: -13 /.-I 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORKATION: score 5 

seq CSALFPLLSLLSC/KE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 



Met Arg Cys Ser Ala Leu Phe Pro 
-15 -10 

Glu Arg Xaa Trp Cys Leu Ser Thr 
5 

His Leu Gly Ser Arg Glu Gin Pro 

20 25 

Vai Trp 
35 



Leu Leu Ser Leu Leu Ser Cys Lys 

-5 : 

Leu Glu Asp Ala Ala Thr Xaa Arg 
10 15 

Ser Gly Asp Ala Glu Pro Val Giu 
30 



(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

Ux) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: -2l7.-l 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5 

seq IISLLKLCSFCFI/KD 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 



Met Leu Tyr Asp Gin Tyr Tyr Leu He He Ser Leu Leu Lys Leu Cys 
-20 -15 -10 

Ser Phe Cys Phe He Lys Asp Phe Lvs Aia Ser Asn lie Thr Leu Val 
-5 1 ' 5 10 

Val He Leu 



.2) ::rFCRMATION FOR SZQ 10 j:0: 363: 
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(i) S£QUENCE CHARACTERISTICS: 

(A) LENGTH: IS amine acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LIKEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig^pepcide 

(B) LOCATION: -657. -I 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5 

seq LCStLSLRFCTLS/FM 

(xi) SEQUENCE DESCRIPTION: SEQ IP NO: 363: 



Met Ala Asn Cys Phe Leu Ser His 
-65 -60 

Lys Pro Ala Leu Thr Gin Ser His 
-45 

Leu Thr Val Glu Lys Ser His Leu 
-30 

Leu Ser Leu Val Leu Cys Ser Phe 
-15 -10 

Ser Phe Met Cys Ser Phe Ala Leu 
1 5 



Lys Ser Gin Thr lie Leu lie Ser 
-55 -50 

Phe Thr Ser Pro Ala Gly Leu Phe 
-40 -35 

Leu Thr Arg Leu Phe Phe His Trp 
-25 -20 

Leu Ser Leu Arg Phe Cys Thr Leu 
-5 

Phe His Leu 
10 



(2) INFORMATION FOR SEQ ID NO: 364: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NA.ME/KEY: sig_?eptide 

(B) LOCATTCM: -18.. -1 

(C) IDENTIFICATION METHOD: Von Heijn- niatri:< 

(D) OTHER INFORMATION: score 5 

sec LTYLLFLPDWAAV/FE 

(xi) SEQUENCE ::ESCRIPTI0N: SEQ ID NO: 364: 



wo 99/06549 PCTAB9a/OI23 1 

296 

Ket His Giy Ala Giy Leu Thr Tyr Leu Leu Phe Leu Pro Asp Trp Ala 
-15 -10 -5 

Ala Vai Phe Glu Leu Tyr Asn Cys Giu Asp Glu Arg Cys Tyr Leu Asp 
15 10 

Leu Ala Arg Leu Arg Giy Vai His Tyr lie Thr Trp Arg Arq Gin Asn 
15 20 25 30 

Lys Vai Phe Pro Gin Asp Lys Giy His His Pro Thr Leu Giy Giu His 
35 40 45 

Pro Lys Phe Thr Asn Tyr Ser Phe Asp Vai Giu Glu Phe Met Tyr Leu 
50 55 60 

Vai Leu Gin Ala Ala Asp His Vai Leu Gin His Pro Giy 
65 70 75 



(2) INFOFUMATION FOR SEQ ID NO: 365: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -15.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.9 

seq CLSATLAFSGSFL/AP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 



Met Cvs Cys Leu Ser Ala Thr Leu Ala Phe Ser Giy Ser Phe Leu Ala 
-15 ' -10 -5 1 

Pro His Leu lie Phe Cys Cys Phe Ser His Leu Asn Vai lie He Leu 
5 10 15 

Leu Ser Ser Leu Ser Pro lie His Giy 
20 25 



(2) rNF0W12VTI0N FOR SEQ ID NO: 366: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4S amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 
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iiir MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ixi FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: -39.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
;D) OTHER INFORMATION: score 4.9 

seq SGLRGLLLQEALG/AV 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 366: 



Met Ala Giu Leu Asp Leu Met Ala Pro Gly Pro Leu Pro Arg Ala Thr 
-35 -30 . -25 

Ala Gin Pro Pro Ala Pro Leu Ser Pro Asp Ser Gly Leu Arg Gly Leu 
-20 -15 -10 

Leu Leu Gin Giu Ala Leu Gly Ala Vai Pro Asp Pro Arg 
-5 15 



(2) INFORMATION FOR SEQ ID NO: 367: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

tix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: -28?. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.9 

seq FLVAC?LFGVCLX/bb' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 



Met Thr Leu Thr His Gly Asn Asn He Leu His Leu Ala Asn ?ne Phe 

-25 -20 -15 

Leu val Aia Cys Pro Leu P.^.e Gly Val Cys Leu Xaa Phe Phe He Leu 

-10 -5 1 

Arg Phe Arg Leu Tyr He Gin Gly Pro Asn Vai Thr Gin Val lie Leu 

i 10 15 20 



-lis Leu Ser Gin Gly Thr Leu Ser 
25 
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(2) INFORMATION FOR SEQ 10 NO: 368: 

(i) SSOUENCS CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE; AMINO ACID 
{DJ TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Ovary 

(ix) FEATURE: 

CA) NAME/KEY: sig_peptide 

(B) LOCATION: -14.. -1 

CO IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score 4.9 

seq VLRWLPWPRGSHS/DS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 



Met Vai Leu Arg Trc Leu Pro Trp Pro Arg Gly Ser His Ser Asp Ser 
-lb -5 1 



(2) INFORMATION FOR SEQ ID NO: 369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hono Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptiie 

(B) LOCATION: -24.. -1 

(C) IDENTIFICATION METHOD: Vcn Heijne matrix 

(D) OTHER INFORMATION: i>core 4.8 

seq FSFLGTLFHKSNS/ED 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 



Met Lvs Ala Arg Leu Scr Cly Asn Leu lie Cys Phe Ser ?he Leu Gly 
-20 -15 -10 

Thr Lea Phe His Lvs Ser Asn Ser Clu .^3p Ser Ser Val Gly Lys Gly 
-3 1 S 

Asp Trp Lys I-ys Lys Asn Lys 
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10 - 15 



(2) IMFORMATION FOR SEQ ID NO: 370; 

(i) SEQUENCE CHAFACTERISTICS : 

(A) LEMGTH: 37 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) SAME/KEY: sig^peptide 
(3) LOCATION: -Lo. . -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.8 

seq VCLVPQTPSLCLG/KG 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 



Met Ser His Vai Cys Leu Vai Pro Gin Thr Pro Ser Leu Cys Leu Giy 
-15 -10 -5 

Lys Giy Thr Pro Arg Ser Arg Ser Ala Pro Phe Gin Ser Ser Gly Pro 
15 10 15 

His Arg Leu Cys Ala 
20 



(2) INFORMATION FOR SEQ ID NO: 371: 

U? SEQUENCE CHAPACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -29.. -1 

(C) IDENTIFICATION METHOD: Von iisijne matri:-: 
(0) OTHER INFORMATION: score 4.3 

seq VLTSVNLFIGING/SV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NC: 371: 
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Met Tyr Pro Ala 3er Phe 
-25 

Val Leu Thr Ser Val Asn 
-10 

Thr Phe Vai Leu Glu Leu 
5 

Asp He Leu Lys Ser Val 
20 25 



300 

Vai Phe Lys lie Pro Ser Thr 
-20 

Leu Phe He Gly lie Asn Giy 



Phe Thr Asp Asn Lys Leu Asn 

10 15 

Phe Leu He Phe Pro His Phe 
30 
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Ala Tvr Val 
-15 

Ser Val Ala 
1 

Asn lie Asn 



Cys Leu Gly 
35 



Arg Gly Gin Thr 



(2) INFORMATION FOR 3EQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -30. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.8 

seq RSSLWVTAPLV3A/C? 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 



Met Ser Ser Ser Arg Lvs Asp His Leu Giy Ala Xaa Ala Gin Ser Pro 
-30 . -25 -20 -15 

Ser Arg Ser Ser Leu Trp Val Thr Ala rTo Leu Val Ser Ala Cys Pro 
-10 -S 1 

Thr Cys Ser Pro Ala Thr His Pro Thr Gly 
5 10 



(2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQ'JENCE CHARACTERISTICS: 

(A) LENGTH: 5 6 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) CRIGIMAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
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- ■ iT) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: -19. .-1 

(C) IDENTiriCATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.8 

seq ATYLVQSSACCPA/IV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 



Met Aia Ser Pro Aia Ala Ala Thr ?yr Leu Vai Gin Ser Ser Ala Cys 

-15 -10 -5 

Cys Pro Ala lie Val Arg His Leu Cys Gin Xaa Tvr Arg Ser Met Pro 
1 5 ' 10 

Val Phe Leu Asp Pro Ala Xaa lie Ala Thr Leu Glu Gly lie Ser Trp 
15 20 25 

Arg Leu Pro Ser Ala Pro Ser Asp 

30 35 



(2) INFORMATION FOR SEQ ID NO: 374: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 69 amino acids 
(B) TYPE: AMINO ACID 
ID) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

{ix> FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: -6l7.-l 

(CI IDENTIFICATION METHOD: Voti Heijn.e niatrix 
(D) OTHER INFORMATION: szoze 4.7 

seq LLPCNLHKSWLHS/SP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 



Met Asn Ala Ala lie Asn Thr Gly Pro Ala Pro Ala Val Thr Lys Thr 

-60 -55 -50 

Glu Thr Glu Vai Gin Asn Pro .^^sp Val Leu Trp Asp Leu Asp lie Pro 

-15 -40 ' -35 " -30 

Glu Ala Arg 5-sr His Ala Asp Gin Pvsp Ser Asn Pro /.^a Ala Glu Ala 

-25 ' ' -20 -15 



Leu Leu Pro Cys Asn Leu His Xaa S-^r Trp Leu His Ser Ser Frc .^.rg 
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Pro AsD Pro His Ser 
5 



(2) INFORMATION FdR SEO ID MO: 375: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

;ix) FEATURE: 

(A) NAME/KEY: sig^pepTiide 

(B) LOCATION: -4 3. .-1 

(C) IDENTIFICATION METHOD: Von Heijr.e matrix 

(D) OTHER INFORMATION: score 4.7 

seq GIFLVIFCSESFS/LL 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 



Met lie Asn Leu Leu Val Gly Asn Cys lie Tyr Leu Leu Giy Ala lie 
-40 -35 -30 

Arg Ala Ser Cys Met Cys Arg Xaa Met Ser Phe Ala Lys Phe Gly He 
-25 -20 -15 

Phe Leu Val He Phe Cys Ser Giu Ser Phe Ser Leu Leu Leu Tro Asn 
-10 -5 1 ' 5 

Phe Ser Ser lie Tyr Val Lys Thr Phe Trp Pro Val Gly 
10 15 



(2) INFORMATION FOR SEQ ID NO: 376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

vA) ORGANISM: Homo Sapiens 
it) TISSUE TYPE: Spleen 

liK) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATIOM: -19.. -1 

{O IDENTIFICATION METHOD: Vd- Hoi^ne matrix 
(D) OTHER IMFORTIATION: score 4.7 

seq L?.?LLRD?CCLLA/Qr 
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(xi) SECUENCE DESCRIPTION: SEQ ID NO: 376: 

«et Leu Cys Cys G-ly Pro Leu Arg Phe Leu Leu Arq Asp Pro Giv Cys 
-15 -10 -S 

Leu Leu Aia Gin Pro Glu Leu Ala Phe Trp Gly Pro Aid Ser ?t\e I.e 
1 5 ' 10 

Ser Giy Gly Leu Val Vai Val Ser Glu Thr Pro His Pro Ser Phe Pro 
15 20 25 

Leu Asp Pro Pro 
30 



(2) IMFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: AMINO ACID 
(D) ?OPOLOG"ir: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: -197.-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.7 

seq ILLRMTVLPTLWT/RR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 



Met Arg Lys Thr Ser Phe lie Leu Leu Arg y.cc Thr Val Leu Pro Thr 
-15 -10 -5 

Leu Trp Thr Arg Arg Arg Val Gin Leu Vai 
1 5 



(2) INFORMATION FOR SEQ ID NO: 378: 

;i) SEQUENCE CHAR.2vCTERI5TIC3 : 

(A) LENGTH: 4 1 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TY?Z: PROTEIN 

(VI) ORIGINAL SOURCE: 

(A) ORGAiVISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 
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{!:<)" FEATURE: 

(A) NAME/KEY: siq_pepcide 
(3) LOCATION: -23..-! 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.6 ' 

seq VRVGLVLVXRALC/LX 

(XI) SIQCENCE DESCRIPTION: SEQ ID MO: 379: 



Met Trp Trp Lys Pro Ala Pro Glu Glu Gly Val Arg Vai Gly Leu Val 

-20 -15 -10 

Leu Vai Xaa Arg Ala Leu Cys Leu Xaa Vai Leu Ser Arg Phe Met Phe 

-5 1 5 

Xaa Asn Pro Gly Leu Gly Gly Met Gly 
10 15 



(2) INFORMATION FOR SEQ ID NO: 379: 

(i) SEQL'ENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: AMINO ACID 
fD) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: siy peptide 
(3) LOCATION: -14?. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(Dl OTHER INFORMATION: score 4.G 

seq r>iFLLG>:3SCVYQ/RP 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 379: 



Met Phe Asn Phe Leu Leu Gly Asn Ser Ser Cvs Val Tyr Gin Arg Pro 
-10 -5 * 1 

lie Arg Leu Lys Leu lie lie Phe Pro Ser Gly 
5 10 



(2) INFORMATION FOR SEQ ID NO: 380: 

(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 119 amino acid3 
(3) TYPE: AMINO ACID 
iO) TOPOLOGY: LINEAR 

(ii) f:01ECULE TY?E: PROTEIN 
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(VL) ORIGINAL SOURCE: 

(A) ORGAN I SN":: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^^pepcide 
(3) LOCATION: "42..-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4,6 

seq LDPAV3LSAPAFA/SA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 



Met Lys Arg Giy Ala Phe Ser Asn Leu Asn Asp Ser Gin Leu Ser Ala 
-40 -35 -30 

Ser Phe Leu Gin Pre Ser Leu Gin Aia Asn. Cys Pro Aia Leu Asp Pro 
-25 -20 -15 

Ala Val Ser Leu Ser Aia Pro Ala Phe Ala Ser Ala Leu Arg Ser Met 
-10 -5 1 5 

Lys Sar Ser Gin Ala Ala Arg Lys Asp Asp Phe Leu Arg Ser Leu Ser 
10. 15 * 20 

Asp Giy Asp Ser Gly Thr Ser Glu His lie Ser Aia Val Val Thr Ser 
25 30 35 

Pro Arg lie Ser Cys His Giy Ala Ala lie Pro Xaa Ala Xaa Aia Xaa 
40 45 50 

Xaa Xaa Gly Cys Ser Cys Xaa Thr Glu Arg Xaa Leu Xaa Xaa Pro Pro 

55 60 65 70 

Ser Lsu Leu Ser Leu Glu Ala 
75 



{2) INFORMATION FOR SEQ ID NO: 381: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: AMINO ACID 
(0) TOPOLOGY: LINEAR 

(i:.) ^K)LECUL£ TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo SapLari:-, 
C?) TISSUE TYPE: Testis 

{i:<) FEATURE: 

(A) NAME/KEY: sig^peptiCe 
(3) LOCATION: -24.. -1 

fZ) IDENTIFICATION METHOD: /on Heijne matrix 
iD) OTHER :MFCR^L^T^O^': score ^..6 

Jrr FFIFCSLNTLLLG/GV 
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(xir SSQUSNCi DESCRIPTIOM: SEQ ZD NO: 331: 

Met Lys Ser Ala Lys Leu Gly Phe Leu Leu Arg ?he Phe lie Fhe Cys 

-20 -15 -10 

Ser Leu Asn Thr Leu Leu Leu Gly Gly Vai Asn Lys lie Ala Glu Lys 

-5 1 5 

lie Cys Gly Asp Leu Lys Asp Pro Cys Lys Leu Aso Met Asn Phe Gly 

10 15 20 

Ser Cys Tyr Glu Val His Phe Arg Tyr Phe Tyr Asn Arg Thr Ser Lys 

25 30 35 40 

Arg Cys Glu Thr Phe Val Phe Ser Ser Cys Asn Gly Asn Leu Asn Gly 

45 50 55 



(2) INFORMATION FOR SEQ ID NO: 382: 

ID SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: -15.. -1 

(C) IDENTIFICATION METHOD: Vcii Hwijae matrix 

(D) OTHER INFORMATION: score 4.6 

sec IIFPLHSVTGSHP/QC 

(xi) SEQUENCE DESCRIPTION: SEQ TD NO: 382: 



Met .Asp lie Leu Phe Pro Leu His Ser Val lie Giy Ser His Pro Gin 
-15 -10 -5 1 

Cys Leu Pro Glu Arg Xaa Thr Ala Arg Mec He Lys Leu Lys Trp Gly 

5 iO 15 " 

Asn Giy Ser Gly Ser Asp Phe Glv 
20 25 



(2; INFORT^ATION FOR .^"Q TO MO: 33 3- 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino ar.^r.-^ 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEP.R 
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(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_pepricle 

(B) LOCATION: -27. .-i 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.6 

secj rGILILL5QRQWS/KN 

(Ki) SEQUENCE DESCRIPTION: SZQ ID NO: 383: 



Met Leu Lys Vai Phe Arg Ala Xaa His Pro Lys lie Cys His Phe Gly 
-25 -20 ' -15 

lie Leu He Leu Leu Ser Gin Arg Gin Trp Ser Lys Asn Arg Cvs Arg 
-10 -5 1 ' 5 

Glu Giy Cys Leu Thr Thr Leu Phe Leu Phe Giu Ala Giu His Lys Ser 
10 15 20 

Ser Leu Val Lys 
25 



(2) INFORMATION FOR SEQ ID NO: 384: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(i:<) FEATURE: 

(A) NAME/KEY: sig^peptid* 

(B) LOCATION: -34. .-1 

(C) IDENTIFICATION METHOD: Von Hsijne matrix 
(0) OTHER INFORMATION: score 4.5 

seq LXWRKLJiASWTLS/QE 

(xi) SEQUENCE DESCRIPTION: SZO. ID :;0: 384: 



Met Leu Val Arg Asn Ala Arg Arg Gly L:er Arc Gly Arg Ser Pro Trp 

-30 ' ->5 -20 

Trp Arg Ala Gly Cys Leu Xaa Trp Ar; lys Le*- Ala Ala Ser Trp Thr 

-15 -10 -5 

L-iu Ger Gin Glu He Phe Arg Gly Ser Arr l/s 31/ Ser 

1 5 ' :: 
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(2) INFORMATION FOR SZQ ID NO: 385: 

(i; SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

{ii} MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: -29.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(D| OTHER INFORMATION: score 4.5 

seq FTLGLGYPIPTRL/Q? 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 385: 4. 

Met Thr Lys Gly His His His Gin His Pro Leu His Pro His Pro Leu 
-2b -20 -15 

Phe Thr Leu Gly Leu Gly Tyr Pro He Pro Thr Arg Leu Gin Pro Cys 
-10 -5 1 

Thr Leu Ser Ser Asp Pro Leu Leu Asp He Thr Cys Ser Leu Arg Ser 

5 10 15 

Pro Ser Ser Gly 
20 



(2) INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(Ai MAME/KEY: sig peptide 

(B) LOCATION: -237. -1 

{C) IDENTIFICATION METHOD: Von Heijne matrix 
(0} OTHER INFORMATION: score 4.5 

s-sq P.LHILFIVCLARG/KG 

(>:i) SIQrr.Mr?: description: SD?- id :::: 386: 
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Mec Thr Tyr Kis Xaa lie Gin Pi\e Ser Giu Arg Leu H^g lie Leu Phe 

-20 -15 

He Val Cys Leu Ala Arg Gly Lys Gly 

-5 1 



(2; INFORMATION FOR SEQ ID KO: 387: 

(i) SEC^-*s:NCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

«vi) ORIGINAL SOURCE: 

(Ai ORGANISM: Homo Saoiens 
iT) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: -46.. -1 

(C) IDENTIFICATION MET.HOD: Von Hsijne matrix 

(D) OTHER INFORMATION: score 4.5 

seq LIYCGLSQPLTLG/VT 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 



.Met Ser Gia Phe Pre Leu Cys Ser Pro Pro Trp Lys Pro Leu vai Lys 
-^5 -40 -35 

Val Ser Arg Asn Leu Lys lie Arg Me-: Ser lie Pro Trp Pro Leu Ser 
-30 -25 -20 * -IS 

Val Leu lie Tyr Cys Giy Leu Sei Glii Pro leu Thr Leu Gly Val Thr 
-10 -5 1 

Sor Pro Ser Phe Pro Gin Asn Ser Phe Phe Pro Tip Lea Pro Giu His 
5 10 15 

Pro Thr His Leu Vai Ser Ser Thr Pro Gin 
20 25 



(2) i::rOP.MAT:CM for seq id KO: 393: 

U) SEQVEUCE characteristic::: 

{A: LENGTH: 140 antino acids 
(?.; TYPE: AMINO ACID 
C; TOPOLOGY: LINEAR 

'ii) t<ClEC:JLF. TYPE: PROTEI!! 

.vi) Z}.ZZli:kL SOURCE: 

ORGAMISM: rtorr.o 5;-- • f*ns 
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(xi)_3EQUEMCE DESCRIPTION: 5EQ ID MO: 389: 



Met Vai Leu Thr Thr Leu Pro Leu Pro Ser Aia Asn Ser Pro Vai Asa 

-35 -30 -25 

Kec Pro Thr Thr Giy Pro Asn Ser Lea Ser Tyr Aia Ser Ser Ala Leu 

-20 -15 -10 

Ser Pro Cys Leu Xaa Ala Pro Lvs Ser Pro Arg Leu Gly 
-5 1 ' 5 



(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 ammo acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLLCULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HcTio Sapiens 
(?) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: -30T.-i 

(C) IDENTIFICATION METHOD: Voa Heijne matrix 
CD) OTHER INFORMATION: score 4.3 

seq LLPTLPWLPSTRL/LS 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 



Met Gin Arg Asn Ala Thr Phe 11- His Leu Gin Leu Ala He Arg Pro 
-30 -25 -20 -15 

Ser Leu Leu Pro Thr Leu Pro Tru Leu Pro Ser Thr Arg Leu Leu Ser 
-10 ' -5 1 

Pro Thr Pro Leu Gly Gin Leu Arg Giy Pro Pro Gly Xaa Gin Arg Ala 
5 10 ' 15 

Met Pro Thr Ala His Leu Arg 
20 25 



(?.) INFORMATION FOR SEQ ID ::0 : j;U: 

{!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 att^i- ^ acids 
•3) TYPE: AMINO AC 10 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PP.iJT:..:: 

•;vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: -15.. -1 

(CI IDENTIFICATIOM METHOD: Von Heijne matrix 
(D) OTHER INFORMATIOM: score 4.3 

seq ILFCFHSrH?LFQ/DT 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 



Met Asn lie Leu Phe Cys Phe His Ser Phe His Pro Leu Phe Gin Asp 
-15 -10 -5 1 

Thr lie Glu Phe 
5 



(2) INFORMATION FOR SEQ ID NO: 392: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amiao acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

{ix) FE.ATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: -20.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
{0} OTHER INFORM.^TI0M: score 4.3 

seq rMFLFLVQ LCI LA/CD 

(xi) SEQUENCE DESCRIPTION: SLQ ID NO: 392: 



Met Leu Thr Asn Arg Asn Tyr Phe Asn Phe Leu Phe Leu Val Gin Leu 
-20 -15 -iO -5 

Cys lie Leu .^J.a Cys Asp Asn Ala Tyr Leu GLn Ser Cys Pro Leu Thr 
1 5 10 

Ser Lys Thr Pro Leu Leu Gin Thr His Ser Ala Leu Phe Tyr Asn Ser 
15 20 25 

Thr Tyr Gly lie Phe Leu Leu Leu Gly Val 
30 35 



(2) IMFORVATICN FOR SEQ ID [10: 39] 
(i) .^7:":7.MCF CHARACTE?:'.-T:r3 
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(A) LENGTH: 58 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LIMEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(AJ ORGANISM: Hcmo Sapiens 
(F) TISSUE TYPE: Testis 

(ijc) FEATURE: 

(A) NAME/KEY: siq_peptide 

(B) LOCATION: -26.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.3 

seq ALCRFVGMQPCTA/QT 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 



Met Lys Leu Asn Pro Gly Gin Vai 
-25 -20 

Arg Phe Val Gly Met Gin Pro Cys 
-10 -5 

His Gly Thr His Asn Thr Arg Glu 
10 

Lys Asn Thr Arg Arg Phe Ser Pro 
25 30 



Pro Thr Trp Trp Giu Ala Lau Cys 
— i * 

Thr Ala Gin Thr Gly Leu Leu Pro 
1 5 

Arg Gin Arg Asd Pro Ser Ala Gin 
15 20 

Val Gly 



(2) INFORMATION FOR SEQ ID NO: 394: 

{!) iJEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 
(3) TYPE: AMINO ACID 
(Dl TOPOLOGY: L1N=:AH 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: siiy^peptide 
(3) LOCATION: -27. .-1 

(C) IDENTIFICATION METHOD: Von Hei^r.s matrix 

(D) OTHER INFOPa^ATION: score 4.3 

seq LCLNLCPC3SSLL/3? 

(xi) SEQUENCE DESCP.i rTTOri : GZQ ID NO: 394: 



Mei L€:u Ala Gly Phe Arg Aro- ;.;or Ala i?ro Ala Ser Gin Ssr Leu Cys 
-25 -2C -15 



ys Pro Cys 



.Ser leu Leu 5c r Pro Ala 
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;2) INFORMATION ?0R SEQ ID NO: 395: 

(1) sequence: CHARACTERISTICS: 

(A) LENGTH: 25 anino acids 
(3) TYPE; AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -18.. -1 

(C) IDENTIFICATION METHOD: Vcn Heime matrix 

(D) OTHER INFORMATION: score 4.3 

seq SFYLLFFLNDVPP/CP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 



Met Lys Giu Gly Ala Ser Phe Tyr Leu Leu Phe Phe Leu Asn Asp Vai 

-15 -10 -5 

Pro Pro Cys Pro Pro His Thr Pro Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 396: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

Ui) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hcn;o Sapiens 
(F) TISSUE. TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig_ peptide 

(B) LOCATION: -297.-1 

(C) IDENTIFICATION METHOD: Von Heijr.e nvatrix 

(D) OTHER INFORMATION: score 4.3 

seq ETLLLKLS3QSRT/NR 

(/.I) SEQUENCE DESCRITTICM: SEQ ID NO: 396: 



Met Gly L^a Giu Cys Cys Cvs rro Pro His Asn Leu Ar^ Vai Tyr lie 
-25 ' -20 -i^ 
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Giu Thr Leu Leu Leu Lys Leu Ser Ser Gin Ser Arg Thr Asn Arg Lei: 
-10 -5 1 



(2) INFORMATION FOR SEQ ID NO: 397: 

(i) SEQUENCE CHARACTERISTICS: 

{A; LENGTH: 34 amino acids 
(3) TYPE: AMINO ACID 
(3) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Homo Sapiens 
(r) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME /KEY: sig_peptide 

(B) LOCATION: -21..-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.2 

seq VLSIAASLLQCRL/AV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 



Met Gin Leu Cys Pro Phe Thr Ser Val Leu Ser lie Ala Ala Ser Leu 
-20 -15 -10 

Leu Gin Cys Arg Leu Ala Val Val Thr Giu Thr lie Trp Pro Pro Gin 
-5 15 10 

Xaa Trp 



(2) INFORMATION FOR SEQ ID NO: 398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii> MOLECULE TYPL: PKOTKIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peotide 

(B) LOCATIOM: -44..-! 

(C) IDEMTiriCATIOM METHOD: Von Heij:\e matrix 

(D) OTHER If:: .^RMATION: score 4.2 

seq QLLFKLN3TVJCRA/LQ 

(xi) SEQUENCE CI3C? T ?TIO^: : SEQ ID NO: 398: 
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Met Asp Val Thr Cys Cys 
-40 

Val Cys Cys His Giy Cys 
-25 

Leu Leu ?he Lys Leu Asn 
-10 

Thr Ser Leu Ala Ser Arg 
5 10 

Glu Phe Phe Thr Val Thr 
25 
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Phe Asp Ala Val Glu Gly 
-35 

Val Ser Trp Leu Cys Leu 
-20 

Ser Thr Trp Cys Arg Ala 
-5 

Arg Leu Trp Met Trp Val 

15 

Pro Trp 



PCT/IB9g/01231 

Ser Asp Phe Arc 
-30 

Gir. Met Leu Gin 
-15 

Leu Gin Ser Glu 
1 

Ser His Leu Xaa 
20 



(2) INFORMATION TOR SEQ ID NO: 399: 

(i) SEQUENCE CHA?ACTERISTICS : 

(A) LENGTH: 26 amino acids 

(B) TYPE: mi^O ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -22. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.2 

seq HCFCFTLFSYSSS/FF 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 



Met Arg Gin Giy Pro Gly Ala Pro Leu His Cys Phe Cys Phe Thr Leu 

'-20 -15 -10 

Phe Ser Tyr Ser Ser Ser Phe Phe Phe Phe 
-5 1 



(2) iNFORTiATION FOR SEQ ID NO: 400: 

(i) SEQUENCE CHAR.^CTERISTICS : 

(A) LENGTH: 2 3 amino acids 
(3) TYPE: AMINO ACID 
iO) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

ivi) ORIGINAL SOUP.Ci:: 

(A) ORGANISM: lio.-no Sapiens 
{?) TISSUE TVi;:: Tes::iS 
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( ix) 'FEATURE: 

:A) NAME/KEY: slg peptide 
(B) LOCATION: -1^7 -1 

:C) IDENTIFICATION METHOD: Von Keijne matrix 
■:D) 0TH£R INF0R^J\TI0M: score 4.2 

seq ITLLGTWLTXaLQ/PP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 



Met His lie Thr Leu Leu Giy He Trp Leu Thr Xaa Arg Leu Gin Phe 
-15 -10 -5 1 

Pro Arg Ser Giy Arg Ala Giy 
5 



(2) INFORMATION FOR SEQ ID NO: 401: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 ammo acics 

(B) TYPE: .o^IMO ACID 
(D) TOPOLOGY: LIMEA.R 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -17.. -1 

fC) IDENTIFICATION METHOD: Von Heijne matrix 
(0) OTHER INFORMATION: score 4.2 

seq SWVCLL5AGTAFE/0Y 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: ^01: 



Met Leu Tyr Giy Ser Tro Val Cys Leu Leu ier Ala Giy Thr Ala Phe 
-15 * -10 -5 

Glu Asp Tyr His Leu Giy Giy Thr 
1 5 



(2) INFORRATION FOR SEQ ID NO; 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 50 anmo acids 
(3) TYPE; AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

.[vi) ORIGINAL 50'JRCE: 

{A) ORGAMISX: Hor.z Sapiens 
(F) TISS-JE TYPE: J-erus 
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(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: -32.. -1 

(C) TDENTIFICATIOM METHOD: Vor. Heijne matrix 
{D) OTHER INFORMATION: score 4.2 

seq XXXXFLLGRRVVG/ES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 



Met Leu Phe Phe Pro Leu Leu Ser 
-30 -25 

Leu Leu Lys Xaa Xaa Xaa Xaa Phe 
-15 -10 

Giu Ser Xaa Phe lie Phe Thr Cys 
1 5 



Tyr Gly 



Phe Arq Phe Leu Pro Ser Glu Ser 
-20 

Leu Leu Gly Arg Arq Val Val Gly 
-5 

Gly Asn Leu Leu Leu He Trp Pro 
10 15 



(2) INFORMATION FOR SEQ ID NO: 403: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: -16. . -X 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.2 

seq WAILGCWGTLSRG/HL 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 



Met Pro Val Trp Ala lie Leu Gly Cys Trp Giv Thr Leu Ser Arg Gly 
-15 -10 -5 

His Leu Pro Val Ser Leu Asp Pro Lys 
1 5 



(2) INFORMATION FOR V)LQ XD NO: 404: 

(i) SEQUENCE CHAPACTERI3TICS : 

(A) LENGTH: 40 anino acids 
(3) TYPE: AMINO ACID 
(D; TOPCr/GV: LINEAR 
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(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix} FEATURE: 

(A) NAME/KEY: siq_pepticle 

(B) LOCATION: -38.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.1 

seq GILCGSLPGPSLC/PP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 



Met Gly Met Ser Gly Lys Lys His Phe Pro Lea Ser Trp Asp His lie 
-35 -30 -25 

Gin Giy Ser Thr Glu Ala Thr Ser Gin Gly lie Lea Cys Gly Ser Leu 
-20 -15 -10 

Pro Gly Pro Ser Leu Cys Pro Pro 
-5 1 



(2) INFORMATION FOR SEQ ID NO: 405: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOC.-.TION: -37. .-1 

(C) IDENTIFICATION METHOD: Vor. Heijne matrix 

(D) OTHER INFORMATION: score 4 

seq PLSLDCGHSLCRA/CI 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 05: 



Met Ala Ser Lys lie Leu Leu Asn Val Gin Glu Glu Vai Thr Cvs Pro 
-35 -30 -25 

lie Cys Leu Glu Leu Leu Thr Giu Pro Leu Ser Leu Aso Cys Gly His 
-20 -15 -10 

Ser Leu Cys Arg Ala Cys He Thr Vai Ser Asn Lyc Ciu Ala Val Thr 

15 10 

Ser :^ec L-ly Gly Ly? oer Ser Cys Pro Vai Cys Giy He Ser Xaa Ser 
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■ 15 20 25 

Xaa Giu His Leu Gin Ala A<?n Gin His Arq 
30 35 



(2; INFORMATION FOR SEQ ID NO: 405: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: -14?. -1 

(C) IDENTIFICATION METHOD: Von Heijne ma!:ri:< 
(0) OTHER INFORMATION: score 4 

seq YYMVCLFFRLIFS/EH 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 



Met Tyr Tyr Met Val Cys Leu Phe Phe Arg Leu lie Phe Ser Giu His 
-10 -5 1 

Leu Pro lie He Giy Thr Val Thr Ser His Lys Thr Gly Thr Gly 
5 10 15 



(2) INFORMATION FOR SEQ ID NO: 407: 

ii) SEQUENCE CHAR.ACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(AJ NAME /KEY: sig_peptide 
(31 LOCATION: -36. .-1 

(C) IDIMTIFICATION METHOD: Von Heijne mavvix 

(D) OTHER INFORMATION: score 4 

seq KLAGLWSPGLVPA/AP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 
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Met GLy Aia Gly Gly Xaa Arg Glu lie Arg Aia Ala Ala Aia Ser Trp 

-35 -30 -25 

Leu Arg Aia Aia Giu His Ser Lys Leu Aia Gly Leu Trp Ser Pro Gly 

-20 -15 ' -10 -5 

Leu Val Pro Aia Aia Pro Arg Thr Giu Asn Tyr Thr lie Gly Pro Leu 

1 5 10 



(2) INFORMATION FOR SZQ ID NO: 408: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE; 

[A) ORGANISM: Homo Sapiens 
:r) TISSUE TYPE: Testis 

{i>;) FEATURE: 

(A) NAME /KEY: sig^peptide 

(B) LOCATION: -60. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4 

seq LVRRTLLVAALRA/WM 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408: 



Xet Giy Ser Lys 
-60 

Arg Gin Arg Arg 



Arg Vai Lys Ala 

Vai Arg Arg Thr 
-10 

Cys Trp Trp Arg 
5 



Cys Cys Lys Gly 
-55 

Gin Lys Leu Leu 

-40 

Aia Gly Gin lie 



Leu Leu Vai Ala 
-5 

Thr Leu Val Gin 
10 



Gly Pro Asp Giu 
-50 

Leu Ala Gin Leu 
-35 

Gin A.ia Trp Trp 
-20 

Ala Leu Arg Aia 



Arg Arg lie Arg 

15 



Aso Ala Vai Giu 
-45 

His His Arg Lys 
-30 

Arg Giy Vai Leu 
-15 

Trp Met lie Gin 
1 

Gin Arg Arg Gin 
20 



(2) INFORMATION FOR SEQ ID MO: 409: 

•i) SEQUE'ICe CHAR.ACTERISTICS : 

(A) LENGTH: 30 amino acids 

(3: type:: a^mino acid 

(D; TOt'OLOGY: LINEAR 
MOLECULi TYPE: PROTEIN 

(v.) OPvTr-'ri'-r. SOURCE: 
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-.{A) 0RGANI3M: Hcmo Sapiens 
(?) TJSSUE TYPE: Ovary 

(ix) FEATURE: 

;A) N?^ME/KEY: sig^peptide 

(B) LOCATIOM: -26. .-1 

(C) IDENTiriCATION METHOD: Von Heijne matrix 
{D) OTHER INFORMATION: score 4 

seq SIHSWQLLTSAQP/QQ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 



Met Gin Gin Giy His Pro His Leu Ser Aia Gly Thr Leu Ser lie His 
-25 -20 -15 

Ser Trp Gin Leu Leu Thr Ser Ala Gin Pro Gin Gin Aia Gly 
-10 -5 1 



{2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -49. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4 

seq ATCCLSLFQWCAV/LR 

(xi) SEQUE.^JCE DESCRIPTION: SZQ ID MO: 410: 



Met Ser Arg Tyr Giu Xaa Giy Ser Ser Leu Leu Pro Phe Pro Asp His 
-45 -^0 -35 

Phe Ser Vai Tvr Ser Phe Lys Xaa Xaa Ser Phe Phe Glu Ala Tyr Ser 
-30 -25 -20 

lie Ser Asp Tvr Ala Thr Cys Cvs Leu Ser Leu Phe Gin Trp Cys Ala 
-15 ' -ic -5 

Val Leu Arg Phe Leu Ser Leu Pro Leu Pro 
1 5 



(2) INFORMATION FOR SEQ ID NO: 411: 
(i) SEQVE::':E Cr.A?-\CTERISTICS: 
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- (A) LENGTH: 29 amino acids 
(3) TYPE: AMI^^O ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE. TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
{F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(B» LOCATION: -24 . .-i 

(C) IDENTIFICATION* METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.9 

seq LLLHHYLLLFITT/SR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 



Met lie Tyr Phe He Lys lie Asa Asn Lvs Leu Leu Leu Leu His His 
-20 -15 -10 



Tvr Leu Leu Leu Phe He Thr Thr Ser Arg Pro Thr Giy 
* -5 15 



(21 INFORMATION FOR SEQ ID NO: 412: 

(13 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovaiy 

(ix) FEATURE: 

(A) NAME/KEY: sig_pepcide 
(3) LOCATION: -27?. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.9 

seq LSWALCLSQSGYY/HP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 



Met Glu Leu Leu Tyr Leu Lys Val Lys Arg Ciy Cln Lys Asp Leu Ser 

-25 -20 -15 

Trp Ala Leu Cvs Leu Ser Zip. Ser Giv Tyr Tyr His Pro Ser His Pro 
-10 -5 1 5 



His Trp 
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(2; INFORMATION FOR 3EQ ID NO: 413: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

iii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^pepride 

(B) LOCATION: -14 . .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.9 

seq TLAVTLSALGATG/LF 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 



Met: Thr Leu Ala Val Thr Leu Ser Ala Leu Gly Ala Thr Gly Leu Phe 
-10 -5 1 

Lys Giu Ala Cys Asp Leu Thr Phe Leu Asn He Gly Gin He Thr Ser 
5 10 15 

Xaa Leu Lys Gin Ser Gly Gly Pro Gin 
20 25 



(2) INFORMATION FOR SEQ ID NO: 414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

iA) ORGANISM: Homo Sapiens 
{?) TISSUE TYPE: Uterus 

(i:<) FEATURE: 

(A) NAME/KEY: aig_peptide 
(3) LOCATION: -41..-i 

(Ci IDENTIFICATION METHOD: Von Heijne matrix 
rJ) OTHER INFORMATION: score 3.9 

seq CRCLITLPRSCRP/ST 

{xi) 3EvUENC£ DESCRIPTION: SEQ ID NO: 414: 



Met Leu Giy ?ro Pro Leu Gin Pro Gly Ser His Gly Lys Val Leu Ala 
-40 ' -35 -30 
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-25 - -20 -15 -10 

ri«a Thr Leu Pro Arg Ser Cvs Arg Fro Ser Thr Ser Vai Pro Arg Xaa 
-5 ^ 1 5 

AlA Ser Thr Arg Ser Sar Gin Arg Pro Xaa Ser Ser Cys Trp Arg Ser 
10 15 20 

Ser Cys Ser Thr Thr Ala T:\r Met 
25 30 



(2) IMFORMATION FOR SEQ ID NO: 415: 

{i) SEQUENCS CHARACTERISTICS: 

(A) LENGTH: 4 8 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sigj)eptide 
(3) LOCATION: -27. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.8 

seq QLXLILVHFPAYS/VE 

(jci) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 

Wet Giy Asn Vai Cys Ser Cys Cvs Leu Arg Ala Arg Tyr Gin Gin Leu 
-25 -20 -15 ■ 

Xaa Leu lie Leu Vai His Phe Pro Ala Tyr Ser Vai Glu Asp Gin Arg 
-10, -5 1 5 

Vai Asp Pro Glv Vai Pro Giy Glu Ser Thr Vai Cys His His Asn Arg 
10 15 20 



(2) INFORI-LATION FOR SEQ ID NO: 416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 
•2) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii: --iClECULE TYPE: PROTEIN 

(vil 0:-MGIMAL SOURCE: 

a; CriGANISM: Homo Sapiens 
r; TISSUE TYPE: Spleen 
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-■{A) NAME/KEY: sig_peptide 
{3} LOCATION: -21. .-1 

(C) IDENTIFICATION METHOD: Von Hftijnp ma'ri-< 

(D) OTHER INFORMATION: score 3.3 

seq PRCVISCIHGVWC/EE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416: 

Met Leu Tyr Gly Leu Gly Ser Gly Pro Arg Cys Val He Ser Cys He 
-20 "15 -10 

His G»y Val Trp Cys Glu Glu Gly Asp Gly Ser Leu Pro Arg Leu His 
-5 1 5 iC 

Val Ala Leu Met He Pro Ala Leu Gly 
15 20 



(2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 
(3} TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: -42.. -1 

(C) IDENTIFICATION METHOU: Von Hei^ne r.atrix 

(D) OTHER INFORMATION: score 3.8 

seq VTPLDSCPPSAHS/A? 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

Met His Arg He Met Thr Leu Leu His Leu Lys Aia Leu Gin Gin Leu 
-40 -35 -30 

Gin Asn Lys He His Val Pro Arg Met Leu Pro Gly Pro Val Thr Pro 
-25 -20 -15 

Leu Aso Ser Cvs Pro Pro Ser Ala His Ser Aia Pro Ser Leu Leu Thr 
-10 " -5 1 ^ 

Ser Gin Leu Pro Leu Gin His Thr Asn Ala Pro Pro Pro His Gly Leu 
10 15 20 

=«r Leu Arg Arg Ala Leu His Trp He Ala Leu Pro Leu Met Giy 
25 30 35 



ro?. SEQ ID NG: 418: 
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(i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 19 aniino acids 
(3} TYPE: AMINO ACID 
(05 TOPOLOGY: LINEAR 

(ii) NOLECULE TYPE: PROTEIN 

(vil ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Ovary 

(ix) FEATURE: 

{A) NAME/KEY: sig^peptide 
(3) LOCATION: -13. . -i 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.8 

seq MLFLVLFYSAIFL/FT 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 



Met Leu Phe Leu Val Leu Phe Tyr Ser Ala He Phe Leu Phe Thr Leu 
-10 -5 1 

Thr Phe Phe 
5 



(2) INFORMATION FOR SEQ ID NO: 419: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 
(3) TYPE: AMINO ACID 
{D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

FEATURE: 
(A) NAME/KEY: sig^peptide 
(3) LOCATION: -147.-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.3 

scq VSLCVAALFPLQA/YC 

{•si) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 



Met Vai Ser Leu Cys Vai Ala Ala Leu Phe Pro Leu Gin Ala Tyr Gly 
-10 -5 1 



(2) INFORMATIOM FOR SEQ ID NO: 420: 

vi: siove::ce characteristics: 
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(2) INrORHATION FOR SEQ ID MO: 422: 

(i) SEQUENCE CHAEIACTERISTICS : 

■:A) LENGTH: 31 amine acids 
•:9) TYPE: AMIKO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -14.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.7 

seq MLPLFCSPWESGG/RT 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 



Met Met Leu Pro Leu Phe Cys Ser Pro Trp Giu Ser Gly Gly Arg Thr i* 
-10 -5 1 

Vai Lys Gin Ser Giu Gly Xaa Cys Xaa Phe Gin Ala Pro His Gly 
5 10 15 



(2) INFORMATION FOR SEQ ID NO: 423: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(i:<) FEATURE: 

(A) NAME/KEY: sigpeptide 

(B) LOCATION: -15 . .-1 

(C) IDENTIFICATION METHOD: Von Heijae matrix 

(D) OTHER INFORMATION: score 3.7 

seq KLLSDLSVDSARC/KP 

SEQUENCE DESCRIPTION: SEQ ID MO: 423: 



Met .-.1- '-y3 Leu Leu Ser .^sp Leu Ser Val Asp Ser Ala Arg Cys Lys 
-15 -10 -5 1 



Pro Civ ri .-.sn Leu Thr Lys Ser Leu Leu Asn lie His Asp Lys Gin 
5 10 15 
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Leu Gla His Asp Pro Arg 



{2) TNTORMATTON FOR SEQ ID NO: 424: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: AMINO ACID 
JD) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi} ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -18.. -1 

(C> IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION; score 3.7 

seq VCWGHLLPARVST/RS 

(xi) SEQUENCE DESCRIPTION: SEO ID NO: 424: 



Met Cys Giy Tyr Trp Val Cys Trp Giy His Leu Leu Pro Ala Arg Val 
-15 -10 -5 

Ser Thr Arg Ser Ser Glu Gin Pro Arg Val Thr Pro Arg Asp Glu Asp 

15 10 

Ala Met Met Ser Ala Ser Leu Leu Thr Tr? Arg Tyr Val Thr Phe Met 
15 20 25 30 

Val Pro Met Pro Leu Ser Pro Cys Arg Ser Val Trp Val Cys Phe Arg 
35 40 45 

Gin Lys lie Leu Glu Tyr Val Xaa Ala 
50 55 



(2) INFORMATION FOR SEQ ID NO: 425: 

(i) SEQUENCE CHARACTERISTICS: 

(A.) LENGTH: 34 aiQlno acids 
(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

{vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 



U;<1 FEATURE: 

(A) NAME/KEY: sig_peptide 
{3} LOCATION: -24 . . -1 
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(C) IDENTIFICATION METHOD: Von Heijne matrix 
"(D) OTHER INFORKATIOM: score 3.7 

seq AILGISTTLNLLS/IN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: <25: 



Met Lvs Leu Ser Cys Ala Gly Cys Ala Asp Thr Ala lie Leu Gly Leu 
-20 -15 -10 

Ser Thr Phe Leu Asn Leu Leu Ser lie Asn Leu Leu Gly Met lie Ser 
-5 1 5 

Phe Ser 
10 



(2) INFORMATION FOR SEQ ID NO: 426: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix> FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -21.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.7 

seq FSLGSCP.AGPLSA/CV 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 4 26: 



Met lie* Pro Phe Ser Gly Thr Vai Phe Ser Leu Gly Ser Cys Pro Ala 
-20 -15 -10 

Gly Pro Leu Ser Ala Cys Val Pro Asp His Gly Ser Leu Gin Tyr Pro 

-5 1 5 10 

Leu Thr He Tyr Gin Gin Asp Cys Xaa Thr His Xaa Cys Pro krq Cys 
15 20 25 

Leu Ser Leu Pro Leu Gin His Pro Arg Gin 
30 35 



(2) INFORMATION FOR SEQ ID NO: 427: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOT.OGY: T.TN=:AR 
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(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sigjeptide 
(3) LOCATION: -35.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.7 

seq PAVSLSAP.AFASA/LR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 



Met lie Pro Ser Ser Gin Pro Arg Phe Xaa Asn Pro Ala Cys Lys Gin 
-35 -30 -25 -20 

Thr Val Leu Leu Xaa Asp Pro Ala Val Ser Leu Ser Ala Pro Ala Phe 
-15 -10 -5 

Ala Ser Ala Leu Arg Ser Met Xaa Ser Ser Gin .Ala Ala Arg Lys Asp 
15 10 

Asp Phe Leu Arg Ser Leu Ser Asp Gly Asp Ser Gly Thr Ser Glu His 
15 20 25 

He Ser Ala Val Val Thr Ser Pro Arg He Ser Cys His Gly Ala Ala 
30 35 40 45 

He Pro Thr Ala Arg Ala Leu Cys Leu Xaa Cys Ser Cys Cys Thr Glu 
50 55 60 



Arg 



(2> IMFORM.ATION FOR SEQ ID NO: 428: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 
(3) TYPE: AMINO ACID 
ID) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
:5} LOCATION: -15.. -1 

'Z) IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score 3.6 

seq ?TFwLISDSFLTS/Q? 

(/.il SEQUENCE DESCRIPTION: SEQ ID NO: 428: 
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Met Aia Pro Thr Phe Leu Leu lie Ser Asp Ser ?he Leu Thr S*r Glrs 
-15 -10 -5 I 

Pre Ser Phe Phe Phe Phe 



(2) INFORMATION FOR SZQ ID NO: 429: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

tvi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/5CEY: sig peptide 

(B) LOCATION: -197.-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.6 

seq LSLLGIKIQWCLS/EN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429: 



Met He Ser Leu He Vai Leu Ser Leu Leu Gly He Lys He Gin Trp 
-15 -10 -5 

Cys Leu Ser Giu Asn Thr Leu Phe Cys Asp Ser Asp Tyr Leu Lea Ser 
1 5 10 

Pro Lys Ala Pre He Giu Pro Leu Ser Phe Asn Leu Thr Thr Gin Gly 
15 20 25 



(2) INFORMATION FOR SEQ ID NO: 430: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi» ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Ovary 

(i:<) FEATURE: 

(A) NAME/KEY: sig_peptide 
i3) LOCATION: -4 3..-'l 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
other INFOP^MATION: score 3.5 
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sec l:.yfntflprkva/rv 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430: 



Met Aia Cys Asp Ser Phe Leu Lys Asp Ala Leu Pro Gin Giu Leu Ser 

-40 -35 -30 

Gin Lea Xaa Phe Leu Phe Pro Leu Val Asp Met Arg Glu Asp Leu Leu 

-25 -20 -15 

Tyr Phe Asn Thr Phe Leu Pro Arg Lys Val Ala Arg Vai 

-10 -5 1 



(2) INFORMATION FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig_pepcide 

(B) LOCATION: -53.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.6 

seq FLILHFFPQQIRK/KI 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431: 



Met Leu Leu Leu Asn Glu Asn Leu Lys Ala Glu lie Gin Lys Asn Glu 
-50 -45 -40 

Ala Gin Gly Ser Cys lie Leu Phe Leu Phe Cys Phe Glu Ser Gin Asn 
-35 -30 -25 

Met Arg Ser Lys Ser lie Phe Pro Phe Leu lie Leu His Phe Phe Pro 
-20 -15 -10 

Cln Cln lie Arg Lys Lys lie Val Val Leu Leu Leu Gly Leu Asn Ser 
-5 1 5 10 

Gin Lys Ala Gly 
15 



[2) INFORMATION FOR SEQ ID NO: 4 32: 



U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 amino acids 
(3) TYPE: Mim ACID 
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*- (D) TOPOLOGY: LIMEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hcmo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: -267.-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq LLPFTFLSLKAFL/QX 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432: 



Met lie Ser Lys Tyr Val His Tyr Ser Leu Tlir Asp Leu Leu Leu Pro 
-25 -20 -15 

Phe Thr Phe Leu Ser Leu Lys Ala Phe Leu Gin Xaa Arg Val Leu Met 
-10 -5 15 

Ser Leu Pro Gin His Lys Pro Trp 
10 



(2) INFORMATION FOR SEQ ID NO: 433: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: -277.-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
. (D) OTHER INFORMATION: score 3.5 

seq CSLLSSFCALHFG/LK 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 



Met .Ala Arg Thr Met Gly Val Pro Arg Ala Cys Lys Ala Phe Cys Ser 

-25 -20 -15 

Leu Leu Ser Ser Phe Cys Ala Leu His ?r\e Gly Leu Lys Lys Gin Tyr 

-10 -5 1 5 

Gly Thr Ser Tyr Leu His Ala Cys Ala Tyr Ala Ser Pro Leu Thr Trp 

10 15 20 
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(2) INFORMATION FOR SEQ ID NO: 434: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -15.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq LCFLLPHHRLQEA/RX 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434: 



Met lie Leu Cys Phe Leu Leu Pro His His Arg Leu Gin Glu Ala Arg 
-15 -10 -5 1 

Xaa lie Gin Val Leu Lys Xaa Leu Pro Arg Giu Lys Leu 
5 10 



(2) INFORMATION FOR SEQ ID NO: 435: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 
(3) TYPE; AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: 3ig_peptide 

(B) LOCATION: -26.. -1 

(C) IDENTIFICATION METHOD: Vor. Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq QCFFVCFSPKIYG/VI 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 4 35: 



Met Gin Asp Tyr Val Ser His Ala Val Arg Arg His Cys Gin Cys Phe 
-25 -20 rl5 
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?he Val Cys- Phe Ser Pro Lys lie Tyr Giy 
-10 -5 



Leu lie Thr Gly Ala Arg Val Leu Ser Glu 
10 15 

Arg Leu Aso Asp lie Thr Ala Asn Ala Ala 
25 30 

Glu Pro Arg Lys Thr Phe Glu Asn Asn Trp 
40 45 



Trp Asn Trp 
55 



PCT/IB98/0t231 

Val He Thr Trp Thr Val 

1 5 

Pro Gin Arg Leu Trp Val 
20 

Cys Gly Tyr Arg Lys Gin 
35 

Glu Asn Leu Tyr Thr Asp 
50 



(2) INFORMATION FOR SEQ ID NO: 4 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -31.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq VLLNLALSHFNNC/GL 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436: 



Met Glu Phe Ala His Ala Ala Glu Cys Vai Ser Phe Ala Leu Asn Glu 
-30 ■ -25 -20 

Thr His Val Leu Leu Asn Leu Ala Leu Ser His Phe Asn Asn Cys Gly 
-15 -10 -5 1 

Leu Ala Val 



(2) INFORMATION FOR SEQ ID NO: 437: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 amino acids 

(B) TYPE: AMINO ACID 
(0) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
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..(Fl TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3} EOCATION: -30. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
O) OTHER INFORMATION: score 3.5 

seq LLAASWLPROAPC/EA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 



Met Giy Asn Gin Giy Phe Pro Tyr Leu Ser Pro Ser Leu Ser Val Gin 

-30 -25 -20 -15 

Asp Leu Leu Ala Ala Ser Trp Leu Pro Arg Asp Ala Pro Cys Giu Aia 

-10 -5 1 

Pro Pro Giy Leu Pro Ser Gin Thr Met Leu Cys Ala Pro Giy Pro Arg 

5 10 15 



(2) INFORMATION FOR SEQ ID NO: 4 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: -227. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq AQLASPLLPGATP/VA 

(xi) SEQUENCE DESCRIPTION: SEQ , ID NO: 438: 



Met Lys Tyr Gin Met Val Ser Giy 
-20 -15 

Leu Pro Giy Ala Thr Pro Val Ala 
-5 1 

Leu Arg Thr Val Lys Met Met Arg 
15 



Ser Ala Gin Leu Ala Ser Pro Leu 
-10 

Giy Thr lie Leu Lys Ser Leu Leu 
5 10 

Val Tyr Giy 
20 



;2) INrOR>tATION FOR SEQ ID NO: 439: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 
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. . (B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -2x7.-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 12.7 

seq ILFLLSWSGPLQG/QQ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 



Met Gly Pro Ser Thr Pro Leu Leu He Leu Phe Leu Leu Ser Trp Ser 
-20 -15 -10 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 
-5 1 5 10 



Arg His Gly 



(2) INFORMATION FOR SEQ ID NO: 440: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
CB) LOCATION: -20T.-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 7.4 

seq LVFCVGLLTMAKA/ES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440: 



Met Ala Ser Leu Gly His lie Leu Val Phe Cys Val Gly Leu Leu Ttir 
-20 -15 -10 -5 

Met Ala Lys Ala Glu Ser Pro Lys Glu His Asp Pro Arg 
1 5 



i2) INFORMATION FOR SEQ ID NO: 441: 
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{!) SEQUENCE CHARACTERISTICS: 

;A) LENGTH: 47 amino acids 
{3) TYPE: AMINO ACID 
{D) TOPOLOGY: LINEAR 

(ii; MOLECULE TYPE: PROTEIN 

:yi} ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

(LX) TEATURE: 

(A) NAME/KEY: sig_^peptide 
(B} LOCATION: -23.. -i 

(C) IDENTIFICATION METHOD: Voa Heijne matrix 
(D> OTHER INFORMATION: score 7.3 

seq ALSLLLVSGSLLP/GP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 



Met Ser Gly Ser Ser Leu Pro Ser Aia Leu Ala Leu Ser Leu Leu Leu 
-20 -15 -10 

Val Ser Gly Ser Leu Leu Pro Giy Pro Gly Ala Aia Gin Aan Glu Pro 
-5 1 5 

Arg He Val Thr Ser Glu Glu Val He He Arg Asp Ser Pro Val 
10 15 20 



(2) INFORMATION FOR SEQ ID NO: 442: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

_(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 
(3) LOCATION: -577. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORRATION: score 7.1 

seq VGLAWSLGGSRG/SG 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442: 



Met Met Glu Val Val Val Gly Asn Giy Val Val Ala Leu Arg Gly lie 

-55 -50 -45 

Pro ?rc Ara Thr Ser .Arg Lys Ser Ser Arg Lys Thr Arg Phe Cys Giy 

.40 ' -35 -30 
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3Iu Arg Giy Ser Lys Gin 3er Gly Lys Cys Ser Pro Vai Giy Leu Ala 

-25 -20 -15 -10 

Vai Vdl Ser Leu Giy Giy Ser Arg Giy Ser Giy I-ys Giy Leu Giy Arg 

-5 1 5 



Leu 



(2) INTORMATION FOR SEQ ID NO: 44 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE; AMINO ACID 
CD) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/ KEY: sig_peptide 

(B) LOCATION: -16. .-1 

iC) IDENTIFICATION METHOD: Vcn Heijne matrix 
(D) OTHER INFORMATION: score 6.7 

seq CFSLVLLLTSIWT/TR 

(xil SEQUENCE DESCRIPTION: SEQ ID NO: 443: 



Met Ala Arg Cys Ptie 
-15 

Thr Arg Leu Leu Vai 

1 5 

Gin Vai Ser Cys Arg 
20 

Asn Gin Gin Leu Asn 
35 



Ser Leu Vai Leu Leu Leu 
-10 

Gin Giy Ser Leu Arg Ala 
10 

Kaa Met Giy He Thr Leu 

25 

Phe Thr Glu Ala Lys 
40 



Thr Ser He Trp Thr 
-5 

Giu Glu Leu Ser He 
15 

Vai Ser Lys Lys Ala 
30 



(2) INFORMATION FOR SEQ ID NO: 444: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 amino acids 

(B) TYPE: AMINO ACID 
(0) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 
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(ix) -FEATURE: 

(A) NAME/KEV: sig peptide 
(3) LOCATION: -2l7.-l 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORiyiATION: score 5.6 

seq CLSCLLIPLALWS/II 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 



Met Gly Ser Arg Lys Cys Gly Gly Cys Leu Ser Cys Leu Leu He Pro 
-20 -15 -10 

Leu Ala Leu Trp Ser He He Val Asn He Leu Leu Tyr Phe Pro Asn 
-5 1 S 10 

Gly Gin Thr Ser Tyr Ala Ser Ser Asn Lys Leu Thr Asn Tyr Vai Trp 
15 20 25 

Tyr Phe Glu Gly He Cys Phe Ser Gly He Met Met Leu He Val Thr 
30 35 40 

Thr Val Leu Leu Val Leu Glu Asn Asn Asn Asn Tyr Lys Cys Cvs Gin 
45 50 55 

Ser Glu Asn Cys Ser Lys Lys Tyr Val Thr Leu Leu Ser He He Phe 
60 65 70 75 

Ser Ser Leu Gly He Ala Phe Ser Gly Tyr Cys Leu Val He Ser Ala 
80 85 90 

Leu Gly Leu Val Gin Gly Pro Tyr Cys Arg Thr Leu Asp Gly Tro Glu 
95 100 105 

Tyr Ala Phe Glu Gly Thr Ala Gly 
110 115 



(2) INFORMATION FOR SEQ ID NO: 445: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: AMINO ACID 
{D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: -2lT.-l 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.6 

seq CLSCLLIPLALWS/II 

(xi) SEQUENCE DESCRIPTION: SEQ ID MC: 445: 
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Met Giy Ser Arg Lys Cys Giy Giy 
-20 -15 

Leu Ala Leu Trp Ser lie lie Vai 
-5 1 

Giy Gin Thr Ser Tyr Ala Ser Ser 

15 

Tyr Phe Giu Giy lie Cys Phe Ser 
30 35 

Thr Vai Leu Leu Vai Leu Giu Asn 
45 50 

Ser Giy 
60 



Cys "Leu Ser Cys Leu Leu lie Pro 

-ic 

Asn lie Leu Leu Tyr Phe Pro Asn 
5 10 



Asn Lys Leu Thr Asn Tyr Vai Trp 
20 25 



Giy lie Met Met Leu lie Vai Thr 
40 



Asn Asn Asn Tyr Lys Cys Cys Gin 
55 



(2) INFORMATION FOR SEQ ID NO: 44 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(i:<) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -17. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.3 

seq ILFGVSFVFLTHC/TI 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446: 



Met Met Vai Met He Leu Phe Giy Vai Ser Phe Vai Phe Leu Thr His 
-15 -10 -5 



Cys Thr lie Gin Ser Ser Cys Giy 
1 5 



(2) INFORMATION FOR SEQ ID NO: 447: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii? MOLECULE TYPE: PROTEIN 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

<ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: -19.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(0) OTHER INFORMATION: score 4.4 

seq VLVSLPKPHPALT/CC 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 



Met Ser Asn Thr His Thr Val Leu Val Ser Leu Pro His Pro His Pro 
-15 -10 -5 

Ala Leu Thr Cys Cys His Leu Gly Xaa Pro His Pro Val Arg Ala Pro 
1 5 10 

Arg Pro 

15 



(2) INFORMATION FOR SEQ ID NO: 448: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: -106. -rl 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.1 

seq IITLACVPMTSFT/RN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 48: 



Met Xaa Vai Tyr Arg Leu Gin Thr Gin Glu Lys Pro Asn Thr Thr Val 

-105 -100 -95 

Gin Vai Pro Ala Phe Leu Gin Glu Leu Val Asp Arg Asp Asn Ser Lys 
-90 -85 -80 -75 

Phe Glu Giu Trp Cys lie Giu Met Ala Glu Met Arg Xaa Lys Val Trp 
-70 -65 -60 

lis Lys Giu Lys Gin Asn Thr Lys Arg Leu .^rg Scr Cys Thr Lys Glv 
-55 -50 -45 

y 

Tyr L3u Lau Giu Leu Sar Pro Met Ser Leu Ser Leu Trp Asn Gly Cys 
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-40 -35 -30 

Lys Ser Gly Trp Met Asn Gin Gin Xaa Pro Asn Leu Leu lie lie Thr 
-25 -20 -15 

Leu Ala Cys Val Pro Met Thr Ser Phe Thr Arq Asn Lys lie Ser lie 
-IC -5 1 5 

Men Lvs Arg lie Ser Giu Tyr Ala Aia Asp lie Phe Tyr 
10 15 



(2) INFORMATION TOR SEQ ID NO: 44 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii: MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -21, .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.9 

seq LIAWIIILLIFT/SV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 



Met Ptie Pro Vai Leu Gly Trp lie Leu lie Ala Val Val lie lie lie 

-20 -15 -10 

Leu Leu lie Phe Thr Ser Vai Thr Arg Cys Leu 

-5 1 5 



(2) INFORMATION FOR SEQ ID NO: 450: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: AMINO ACID 
(0) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
Cr) TISSUE TYPE: Testis 

(i:-:' FE.-.TURE: 

(A) NAME /KEY: sig_peptide 
(3) LOCATION: -19.. -1 

CO IDENTIFICATION METHOD: Von Heijne matrix 
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:0) OTKI?. INTORMATION: score 3.9 

seq SVC1CPCLMKGQS/£K 

(:<i:. SEQUENCE DESCRIPTION: SEQ ID NO: 450: 



i-!et ?he Ser Cys Cys lie Ser Val Cys Leu Cys Pro Cys Leu Asn Lys 
-15 -iO ^ -5 

Gly Gin Ser Glu Asn Leu Ser Arg Asp Cys Gly His Trp Leu Asn Pro 

1 5 lb 

His His Arq Arg Leu Trp Pro Phe Gly Arg Arg His Pro Gin Asp Cys 
15 20 25 

Gly Leu Phe Gin Asp Ser Gin Xaa Tyr Gly Glu Ser Lys Asp Trp Asn 
30 35 40 45 



Gly 



(2) INFORMATION FOR SEQ ID NO: 451: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION; -29.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq LTYLLLLSPIKYP/LD 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451: 



Met Arg Leu Cys Leu lie Met Tyr Cys Ser Phe Gly Thr Leu Ser His 
-25 -20 -15 

Leu Thr Tyr Leu Leu Leu Leu Ser Pro lie Lys Tyr Pro Leu Asp Lou 
-10 -5 1 

Asp Phe Leu Tyr Pro He Phe Ser Thr Vai Tyr Lys Arg Tyr He Val 
5 10 15 

Thr val Asn Phe Cvs He Ser Cys Ser Glu Ser Phe Leu Leu Ser Asp 
20 ' 25 30 35 

Leu Tie Ala Leu Phe Leu He Arg Glu Leu Zir. Leu Leu Gin His Thr 

40 45 SD 

Val Ser Vai Vai Gin Pro Pro Thr 
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(2' INFORMATION FOR SEQ ID NO: 452: 

(i) SECUENCE CHAFACTERISTICS : 

(A) LENGTH: 4 7 amino acids 
(3) TVPE: AMINO ACID 
(C) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Home Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NA>IE/KEY: sig_pepcide 
(B> LOCATION: -26. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 10.5 

seq LLLALLLPVQVS3/FV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 



Met Giy Lys Giy Met Vai Ala Met Leu He Leu Gly Leu Leu Leu Leu 
-25 -20 -15 



.'^la Leu Leu Leu Pro Vai Gin Vai Ser Ser Ptie Val Pro Leu Thr Ser 
-10 -5 1 5 



Met Pro Glu Ala Thr Ala Ala Giu Thr Thr Lys Pro Ser Asn Giy 
10 15 20 



(2) INFORMATION FOR SEQ ID NO: 453: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: -2l7.-l 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 9.5 

seq LLVLFVLLANVQG/PG 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453: 
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Mer Giy Ser 3er 3iy Leu Lau Ser Leu Leu Vai Leu ?he Vai Leu Leu 
-20 -15 -10 

Aid AsQ Vai Gin Giy Pro Giy Leu Thr Asp Tro Leu ?he Pro Arg Arg 
-5 1 5 ' 10 

Cvs Pro Lys He Arg Giu Glu Cys Glu Phe Gin Glu Arg Asp Vai Cys 
15 20 25 



Thr Lys Asp Arg Gin Cys Arg 
30 



(2; INFORMATION FOR SEQ ID NO: 454: 

(i) SEQUENCE CK.ARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig_pepticie 
(3) LOCATION: -357.-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 8.3 

see NLLLLHCVSRSHS/QN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454: 



Met Vai Leu Giy Giy Cys Pro Vai Ser Tyr Leu Leu Leu Cys Giy Gin 

-35 -30 -25 -20 

Ala Aia Leu Leu Leu Giy Asn Leu Leu Leu Leu His Cys Vai Ser Arg 

-15 -10 -5 

Ser His Ser Gin Asn Aia Thr Ala Glu Pro Giu Leu Thr Ser Ala Giy 

15 10 

Aia Pro Ser Arg Arg Ala Pro Giy Vai Leu Arg Ala Giy Asn Met Ala 

15 20 25 

Thr Pro Thr Leu Arg Ser Ser Ser Aia Leu Thr Tyr Leu Giy 

30 35 40 



:2) INFORMATION FOR SEQ ID NO: 4 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 
(3) TYPE: a.MINO ACID 
(D) TOPOLOGY: LINEAR 
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(ii) .MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
it) TISSUE TYPE: Testis 

(IX) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -18.. -i 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 8.3 

seq LLSLSSLPLVLLG/WE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 55: 



Met Glu Thr Gly Arg Leu Leu Ser Leu Ser Ser Leu Pro Leu Val Leu 

-15 -10 -5 

Leu Gly Trp Glu Tyr Ser Ser Gin Tnr Leu Asn Leu Val Pro Ser Thr 

1 5 10 

Ser lie Leu Ser Phe Vai Pro Phe He Pro Arg Val 
15 20 25 



(2) INFORMATION FOR SEQ ID NO: 4 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -19. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 8,2 

seq QVLALVLVAALWG/GT 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 56: 



Met Ala Ala Ser Leu Gly Gin Val Leu Ala Leu Val Leu Val Ala Ala 
-15 -10 -5 

Leu Trp Gly Gly Thr Gin Pro Leu Leu Lys Arg Ala Ser Ala Gly Leu 
IS 10 

Gin Arg Vai His Glu Pro Thr Trp Ala Gin Gin Leu Leu Gin Giu Met 
15 20 ' 25 

Lys Thr Leu Phe Leu Asn Thr Glu Tyr Leu Met 
30 35 40 
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(2) INFORMATION FOR SZQ ID NO: 4 57: 

(i^ sequence' CHARACTERISTICS: 

:?.} LENGTH: 84 amino acids 
:a) TVPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi} ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -59..-1 

(C; IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score 8.1 

seq FLLGISNLSQVRA/SN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 



Met His lie Lvs Ser He lie Leu Glu Gly Phe Lys Ser Tyr Ala Gin 
-55 -50 -45 

Arg Thr Glu Val Asn Gly Phe Asp Pro Leu Phe Asn Ala He Thr Gly 

-40 -35 -30 

Leu Asn Gly Ser Gly Lys Ser Asn He Leu Asp Ser lie Cys Phe Leu 
-25 -20 -15 

Leu Gly He Ser Asn Leu Ser Gin Val Arg Ala Ser Asn Leu Gin Asp 
-10 -5 15 

Leu Val Tyr Lys Asn Gly Gin Ala Gly He Thr Lys Ala Ser Val Ser 
10 15 20- 



lie Xaa Phe Asp 
25 



(2) INFORMATION FOR SEQ ID NO: 4 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 
{3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Homo Sapiens 
•:F) TISSUE TYPE: Testis 

(ixl FEATURE: 

{A) NAME/KEY: sig^peptide 
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- (3» LOCATIObi: -19. .-1 

(Ci ID2NTIFICATI0N METHOD: Vcn Heijns ir.atrix 
(D) OTHER INFORMATIO!^: score 8 

seq WGrLCVLFTAVH?/AP 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 



Met Ser Pre Ser FrD Arg Trp Giy Phe Leu Cys Vai Leu Phe Ihr Aia 
-15 -10 -5 

Val His Pro Ala Pro Ser Thr Ala Pro Val Gin Asp Lys Cys Fro Val 
1 5 10 

Asn Thr Tro Giu Aia Met Xaa Xaa Val Leu Pro Ala Ala Pro Ala Asn 
15 20 25 

Arg Pro Pro Thr Gin Ala Phe Pro Ser Ala Ser Thr Ala Thr Giy 
30 35 40 



(2) INFORM^iTION FOR SEQ ID NO: 459: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(i)c) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: -24 . .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 7.8 

seq FLLCLC:AYWA3T/AV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459: 



Met Cys Ser Leu Leu Tyr Pro Leu Val Thr Phe Phe Leu Leu Cys Leu 
-20 -15 -10 

Cys lie Ala Tyr Trp Ala Ser Thr Ala Val Phe Leu Ser Thr Ser Asn 
-5 15 

Glu Ala Val Tyr Lys lie Phe Asp Asp Ser Pro Cys Pro Phe Thr Ala 
10 15 ' 20 

Lys Thr Cys Asn Pro Giu Thr Phe Pro Ser Ser Asn Glu Pro Arg His 
25 ' 30 35 40 

Glv 
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1 2) INFORMATION FOR SZQ ID N*0: 4 60: 

(i) SEQUENCE CHAPACTERI3TICS : 

•:A) LENGTH: 51 amino acids 

(E) TYPE: AMINO ACID 
•:D) TOPOLOGY: LINEAR 

(ii^ MOLECULE TYPE: PROTEIN 

(vi: ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F) TISSUE TYPE: Spleen 

I'ix) FEATURE: 

(A) NA>I£/KEY: sig_peptide 

(B) LOCATION: -16 /.-l 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORt^ATION: score 7.6 

seq cLFFSTLFSSI?T/EA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460: 

Met Leu Pro Phe Lea Pne Phe Ser Thr Leu Pne Ser Ser lie Phe Thr 
-15 -10 -5 

Glu Ala Gin Lys Gin Tyr Trp Vai Cys Asn Ser Ser Aso Ala Ser lie 
1 5 10 ' 15 

His Thr Pro Thr Val lie Lys Cys Asn Thr Gin Phe Gin Leu Met Leu 
20 25 30 

Thr Pro Gly 
35 



(2) INFORMATION FOR SEQ ID NO: 461: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 



(ix) FEATURE: 

(A) MAiME/KEY: 3ig_p2p-ide 

(B) LOCATION: -14..-! 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
{D) OTHER INFORMATION: score 7.5 

seq VALNLILVPCCA.A/WC 

{:<i) SEQ^JEHCE DESCRIPTION: SEC ID MO: 4 61: 



Xet val Ala l5u Asn Leu He Lau v^i Pro Cvs Cys Ala Ala Trp Cys 
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-10 

Asp Pro Arg Arg lie His Ser Gin Asp 
5 10 

Al3 AsD Thr Cly Ser Ala Met Gin Arq 
20 25 

hrg Arg Ser Gin Pro Phe Ser Val Giv 

33 40 

Glu Asa Gin Pro Gly Lys Leu Ser Trp 
55 

His Arg lie Cys Asp Leu 
70 



PCTAB98/0I231 

-5 1 

Asp Val Leu Arg Ser Ser Aia 
15 

Arq Giu Aia TrD Aia Giv Trp 
3C 

Leu Pro Ser Ala Giu Arq Leu 
45 50 

Arg Ser Leu Val Gly Glu Gly 
60 65 



(2) IMFORMATION FOR SEQ ID NO: 4 62: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino aci<Ss 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -53.. -1 

(C> IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORMATION: score 7.1 

sec IAVGLGVA.ALAFA/GR 

SEQ ID NO: 4 62: 



Pro Val Gly Glu Ser Leu Arg Tyr 
-45 -40 

Lys Arg Pro Asp Aia Asp Val Asp 
-25 

lie Aia Val Gly Leu Gly Val Aia 
-10 

Aia Phe Arg lie Trp Lys Pro Leu 

5 10 

Lys Lys lie Scr Thr Pro Ser Phe 
20 25 

Giu Gin Lys Met Ser Arg Arg Giu 
40 

r'tu Ser Aia Gly Lys Aia Lys He 



(xi) SEQUENCE DESCRIPTION: 



Met Ala Ala Arg Gly Val He Ala 
-50 

Ala Glu Tyr Leu Cln Pro Ser Ala 
'35 -30 

Gin Gin Arg Leu Val Arg Scr Leu 
-20 -15 

Ala Leu Ala Phe Ala Gly Arg Tyr 
-5 1 

3iu Gin Val lie Thr Giu Thr Aia 
15 

J^r Ser Tyr Tyr Lys Giy Gly Phe 

30 35 

a:-: Gly Leu Giv Val Sei 
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45 50 55 

Arg Thr Ala His Arg Arg Vai Met lie 
60 55 



(2) INFORMATION FOR SEQ ID NO: 453: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 4 3 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A} NAME/KEY: sig^peptide 

(B) LOCATION: -15..-! 

(C) IDENTIFICATION METHOD: Von Keijne matrix 

(D) OTHER INFORMATION: score 7.1 

seq KLKLLSLLRP3LC/IP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: 



Met lie Lys Leu Lys Leu Leu Ser Leu Leu Arg Pro Ser Leu Cys lie 
-15 -10 -5 1 

Pro Gin Leu Leu Arg Thr Asn Ala Thr Leu Leu Phe Thr lie Ala Ser 

5 10 15 

Cvs Asn Leu Gin lie Pro Ala Ser Pro Arg Arg 
20 25 



(2) INFORMATION FOR SEQ ID NO: 464: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Koro Sapiens 
(F) TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: -20 . . -1 

(C) IDENTIFICATION METHOD: Von Helens matrix 
{D) OTHER INFORMATION: score 6.5 

seq GLCVLQLTTAVTS/AF 
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(xi J -SEQUENCE DESCRIPTIOS : ID MC : 



Met Pro Ser Val Asn Ser Ala Giy leu Cys Val Lej Gin Leu Thr Thr 

-20 . -15 -10 -5 

Ala Vai Thr Ser Ala ?he Leu Leu Ala Lys Val Asn Pro ?:-\e Glu Xaa 
: 5 10 

Phe Leu Ser Arg Giy ?he Trp Leu Cys Ala 
15 20 



:2) INFORMATION FOR SEQ ID NO: 4 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
.;B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: PROTEIN 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
{:) TISSUE TYPE: Ovary 



(ixl FEATURE: 

(A) NAME/KEY: sig_pepticie 
(3) LOCATION: -20.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.4 

seq ALFLLVSXYMIRS/GT 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 65: 



Met Met Leu Giy Leu His Phe Ala Leu Phe Leu Leu Val Ser Xaa Tyr 
-20 -15 -10 -3 

Met lie Arg Ser Giy Thr Giy Asn Lys lie Glu Glu Giy Giy Arg 
15 10 



(21 INFORMATION FOR SEQ ID NO: 4 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 amino acids 
(3) TYPE: AMINO ACID 
{Dl TOPOLOGY: LINEAR 

;ii) MOLECULE TYPE: PROTEIN 

;vi) C5.IGINAL SOURCE: 

{.-.} ORGANISM: Honio Sapiens 
{?) TISSUE TYPE: Testis 

iiy.) FEATURE: 

(A) NAaME/KEY: sig^peptide 
•3) LOCATION: -12.. I 
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-■ (CI IDENTIFICATION METHOC: Von Heij-e matrix. 
{0} OTHEH INFORM=iT:CN: score 6.4 

seq KALLLSVLRV'^[.G/r,F 

INI) SEQUENCE DESCRIPTION: SEQ ID NO: 466; 



Met Ala Leu Leu Leu Ser Val Leu Arg Vai Leu Leu Glv Giy ?he Phe 

-ic .5 * 1 

Aia Leu Val Gly Leu Ala Lys Leu Ser Glu Glu He Ser Ala ?ro Vai 

5 10 15 

Ser Glu Arg Met Asn Ala Leu Phe Val Xaa Phe Ala Glu Val Leu Gly 
20 25 30 35 



(2) INEORM.21TION FOR SEQ ID NO: 467: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 
3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(?) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: -17. .-1 

(C) IDENTIFICATION METHOD: Von Hei^ne matrix 
(Dl OTHER INFORMATION: score 6.2 

seq LWLSLVAWKWGEA/VL 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 



Met Leu Lys Ser Leu Trp Leu Ser Leu Vai Ala Trp His Trp Gly Giu 
-15 -10 -5 

Aia Val Leu Leu Ser Pro His Leu Pro Aia Ala Aia Giu Trp Pro Arg 
1 5 10 15 

Aia Aia Cys Asp Ser Gly Gly Glu Pro 
20 



(2) INFORMATION FOR SEQ ID NO: 4 68: 

(i) SiyJENCE CHARACTERISTICS: 

:A) LENGTH: 24 arr.ino acids 
■2: TYPE: AMINO ACID 
..Z) TOPOLOGY: LISE.^R 

•li;- i'iClECULE TYPE: PROTEIN 
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(vi) .ORIGINAL SOURCE: 

(A) ORGANISM: Hone Sapiens 
(F) TISSUE TYPE: Ovary 

(iX; FEATURE: 

(A) NAME /KEY: si:7_pepwide 

(B) LOCATION: -15. .-i 

{C} IDENTIFICATION METHOD: Voa Heijne natrix 
(D) OTHER INFORM.:iTrON: score 6.1 

seq IVTWLLXSFMSSA/EE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 



Met Gly He Val Thr Trp Leu Leu Xaa Ser Phe Met Ser Ser Ala Giu 
-15 -10 -5 

Giu Ser Vai Ser Ala Arg Thr Arg 
5 



(2) INFORMATION FOR SEQ ID NO: 469: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(r) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION: -237. -1 

(CJ IDENTIFICATION METHOD: Von Heijne matrix 
(D) OTHER INFORf-iATION: score 6 

seq IGLMFLMLGCALP/IY 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469: 



Met Ala Gly He Lys Ala Leu He Ser Leu Ser Phe Gly Gly Aid He 
-25 -20 -15 

Giy Leu Met Phe Leu Met Leu Gly Cys Ala Leu Pro He Tyr Asn Lys 
-10 -5 1 

Tyr Trp Pro Pro Gly 
5 



(2) INFORMATION FOR SEQ ID NO: 470: 

(i* SEQUENCE CHARACTEP.ISTIC3 : 

(A) LENGTH: 3-: ariinc acids 
(3) TYPE: AMI MO ACID 
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(ii) MOLECULE TYPE: PROTEIM 

(VI) ORIGr.^AL SOURCS: 

(A) ORGAMISM: Homo Sapiens 
:?) TISS'JE TYPE: Ovary 

(ix) FEATURE : 

(A) NAME/KEY: siq_peptide 
(3) LOCATION: -2b. .-I 

(C) IDENTiriCATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score S.8 

seq VKLVTLSVPTSLA/SS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 



Met Lys Lvs Gin Lys His Gin Lys Leu Trp Cys lie Ser Vai Lys Leu 
-25 ' -2C -15 

Val Thr Leu Ser Vai Pro Thr Ser Leu Ala Ser Ser Leu Thr Ser Pro 
-10 -5 1 5 

Thr Giy 



(2) INFORMATION FOR SEQ ID NO: 471: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
{F) TISSUE TYPE: Testis 

(i'x) FEATURE: 

(A) NAME/KEY: ijiq_peptide 
(3) LOCATION: -40. .-1 

(C) IDENTIFICATION METHOD: Von heijne matrix 

(D) OTHER IMFORKATION: s=ore 5.3 

seq VLFALFVAFLLRG/KL 

(xij SEQUENCE DESCRIPTION: SEQ ID NO: 471: 



Met Asp Giy He Pro Met Ssr Met Lys .Asn Glu Met Pro lie Ser Gin 
-40 -35 -30 -25 

Leu Lou Hcz llz lie Aia Pro Sor Leu Giy The Val Leu Phe Ala Leu 

-20 -15 -iC 

^he Vai .-.la Pha Leu Leu .-.rj Giy Lys Leu Met Clu Thr Tyr Cys Ger 

-5 1 5 



Gin Lys His Thr Arg Leu A?? Tyr lie Giy As? Scr Lys Asn Val Leu 
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10 , lb 20 



Asn Asp Vai Gin His Cly Arg Giu Asp Asp Gly His Giy 

25 3C 35 



(2) INFORM^lTION rOR SZQ ID MC: 472: 



(i) S£QU£MC£ CHARACTERISTICS: 

(A) LENGTH: 91 amine acids 
(3) TYPE: AMINO ACID 
(0) TOPOLOGY: LIN'EAR 



(ii) MOLECULE TYPE: PROTEIM 

(vi) CRIGIMAL SOURCE: 

(A) ORGAMISM: Homo Sapiens 
(r) TISSUE TYPE: Uterus 



(ix) FEATURE: 

(A) NAME/KEY: sig_pep-ide 

(B) LOCATION: -57 . . -i 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
{D) OTHER INFORMATION: score 5.7 

seq LAICSCLPGPGPA/LP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 



Met Giy Giv Phe Leu His Leu Pro Ala Leu Ser Ser Ser Cys Leu Trp 
-55 -50 -45 

Thr Phe Pro Pro Met Cys Val Arg lie Phe Ser Tyr Val Pro Leu Pro 
-40 ' -35 -30 

He Leu Thr Pro Lys Thr lie Asn Leu lie Pro Vai Leu Ala lie Cys 
-25 -20 -15 -10 

Ser Cys Leu Pro Gly Pro Gly Pro Ala Leu Pro Leu Pro Ala Phe Pro 
-5 1 5 

Thr Leu Leu Val Ser Trp Tyr His Cys Pro Pro Gin Lys Lys Thr Gly 

10 15 20 



Met Met Aso Thr Asp Asp Phe Arg Ala Cys Pro 
25 ' "30 



(2) INFORMATION FOR SZQ ID MO: 473: 



(i) SEQUENCE CHAR-^CTERISTICS : 

(Aj LENGTH: 51 amino acids 
(3i TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii; MOLECULE TYPE: PP.07EIN 

(vi) ORIGINAL SOURCE: 

(Ai ORGAN 1 5::: Hcnio Sapians 
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■ :c) TISSUE type:: Taszis 

(ix; r£.^TURE: 

•A) NAME/KEY: sig peptics 
(B> LOCATION: -197* -I 

(C) IDENTinCATION METHCO: Von -:ei-)ne macrix 
iD) OTHER INFORMATION: score 9 

seq WGFLOVL=TAVHP/AP 

ixi) SEQUENCE DESCRIPTION: 3EQ ID NO: 473: 



Met Ser Pro Ser Pro Arg Trp Gly ?he Leu Cvs Val Leu ?he Thr Ala 
-15 -10 " -5 

Val His Pre Ala Pro Ser Thr Ala Pro Val Gin Asp Lys Cys Pro Val 
15 13 

Asn Thr Trp Glu Ala Met Gin Ala Ser Ser Gin Glr. Leu Lea Gin Thr 
15 20 25 

Asp Pro Met 
30 



(2) INEORRATION FOR SEQ ID NO: 474 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptice 
(3) LOCATION: -76. .-1 

{C) IDENTIFICATION METHOD; Von Heijne matrix 
(D) OTHER INFORMATION: score 5.3 

seq IILASASFSPNFT/QV 

(y.i) SEQUENCE DESCRIPTION: SEQ ID NO: 474: 



Met Thr Ser Gin Pro Val Pro Asn Glu Thr tie lie Val Leu Pro Ser 

-75 -70 -65 

Asn Val lie Asn Phe Ser Gin Ala Glu Lys Pro Glu Pro Thr Asn Gin 

-60 -55 -50 -45 

Gly Gl.^. Aso Ser Leu Ly.s Lys rlis Leu His Ala Glu Xaa Lys Val lie 

-40 * -35 -30 



Gly Thr lie Gin lie Leu Cys Gly Mec Me: Val leu Ser Leu Gly lie 



-25 



-20 



-:5 



wo 99/06549 



361 



PCT/IB98>01231 



Ii9 Leu .Ma Ser Ala Ser ?he Ser Pro Asn Phe Thr Gin Val Thr 3e: 

-i: -5 1 

Thr Leu Lei: Asn Ser Ala Tyr Pro Phe lie Gly Pro Gly 

5 10 * 15 



(2) IMfCRMATION FOR SEQ ID NO: 475: 

(i) SEQUENCE CHARACTERISTICS: 

•A) LENGTH: 127 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE; PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

{A} NAME/KEY: sig_peptide 

(B) LOCATION: -9lT.-i 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
(01 OTHER INFORMATION: score 4.8 

seq :ILRLPWLNRSQT/VV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475: 



Met Arg Ala Leu Giu Asn Asp Phe Phe Asn Ser Pro Pro Arg Lys Thr 
-90 -8*5 -80 

Val Arg Phe Gly Gly Thr Val Thr Glu Val Leu Leu Lys Tyr Lys Lys 
-75 -70 -65 -60 

Gly Glu Thr Asn Asp Phe Glu Leu Leu Lys Asn Gin Leu Leu Asp Pro 
-55 -50 -4*5 

Asp lie Lys Asp Asp Gin He He Asn Trp Leu Leu Glu Phe Arg Ser 
-40 -35 -30 

Ser Val Met Tyr Leu Thr Lys Asp Phe Giu Gin leu lie Ser lie He 

-25 -20 -15 

Leu Arg Leu Pro Trp Leu Asn Arg Ser Gin Thr Val Val Glu Glu Tyr 
-iO -5 I 5 

Leu Ala Phe Leu Gly Asn Leu Val Ser Ala Giu Thr Val Phe Leu Arg 

10 15 20 

Pro Cys Leu Ser Mer lie Ala 5sr His Phe Xaa Pro Pro Glu Leu 
25 30 35 



(2) ZNTDRM^TION FOR SZQ ID NO: 476: 

U; SEQUENCE CHAPACTSRISTICS : 

;A) LENGTH: 57 anino acids 
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(3) TVPS: AMINO ACID 
(D; TOPOLOGY: LINEAR 

(ii) C^OLECULZ TYPE: PROTEIN 

(v'l) ORIGIMAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleea 

(IX; FEATURE: 

(A) NAME /KEY: sig^peptide 

(B) LOCATION: -31..-1 

(C) IDENTIFICATION METHOD: Vcn Heijne matrix. 

(D) OTHER INFORMATION: score 4.8 

seq WAFSCGTWLPSRA/EW 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476: 



Met Vai Phe Pro Ala Lys Arg Phe Cys Leu Vai Pro Ser Met Glu Giy 
-30 -25 -20 

Vai Arg Tro Ala Phs Ser Cys Giy Thr Tr? Leu Pro Ser Arg Ala Glu 
-13 ' -10 -5 1 

Trp Leu Leu Ala Vai Arg Ser lie Gin Pro Glu Glu Lys Glu Arg lie 
5 10 15 

Giy Gin Phe Vai Phe Ala Arg Asp Giy 
20 25 



(2; INFORM.J\TION FOR SEQ ID NO: 477 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

•vi) ORIGINAL SOURCE; 

(A) ORGANISM: Homo Sapiens 
(Fl TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(8) LOCATION: -82. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.7 

seq LTCLADLFH3IAT/XK 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 



Met Asn Cys Phe Gla Giv Thr Asn Ala Ser Ala Leu Glu Lys Asp lie 
-30 ' -75 -7C 



Glv Pro Glu Gin Phe ?ro lie Asn Glu His Tyr Phe Giy Leu Vsl Asn 
-65 -60 -55 
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Ph5 Giv Asn Thr Cys Tyr Cys ksn Ser Val Leu Zin Ala Leu lyr Ser 
-5C ' -45 * -40 -3S 

Cys Arc Pro ?ne Arg Giu Asn Val Leu Ala Tyr Lys Ala Gin Gia Lys 
-3C -25 -20 

Lv5 Lv3 GIu Asn Leu Leu Thr Cys Leu Aia Asp Leu Phe His Ser lie 
- ' -15 -5 

A' a Thr Xaa Lys Lys Lvs Val Xaa Ser Ser His Leu Gly 
1 " 5 10 



(2) INFORMATICN FOR SEQ ID NO: 473: 

(i) SEQUENCE CHAEIACTERISTICS : 

(A) LENGTH; 55 amino acids 
(3) TYPE: AMINO ACID 
(0) TOPOLOGY: LINEAR 

{ii^ MOLECULE TYPE: PROTEIN 

(VI) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -16.. -I 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.6 

seq ALRVRXXXFGTRA/CR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478: 



v-f Ala Ma Ala Leu Arg Val Arg Xaa Xaa Xaa Phe Gly Thr Arg Aia 
-15 -10 -5 

Cvs Arg Arg His Gly Leu Pro His Arg Aia Xaa Tr? Leu Arg Asn Arg 
'l 5 10 15 

Val Xaa Asp Arg Tyr Phe Arg lie Gin Giu vai Leu Lys Xaa Ala Arg 
20 25 30 

His Phe Arg Gly Arg Lys Arg 
35 



(2) INFORMATION FOR SEQ ID NO: 479: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 amino acids 

(B) TY?I: AilINO ACID 
(D) TOPOLOGY: LINEAR 

(L::i MOLECULE TYPE: PROTEIN 
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(VI) .CRIGIMAl SO-JRCI: 

(A) ORGAMISM: Homo Sapiens 
(D TISSUE TYPE: Testis 

(iKi FEATURE : 

(A) NAME/KEY: siq_peptide 

(B) LOCATION: -15.. -1 

(C) rDENTinCATION METHOD: Von Heijr.e' matrix 

(D) OTHER INFORMATION: score 4.6 

seq LLTHNLLSSHVRG/VG 

ixi) SEQUENCE DESCRIPTICN: SEQ ID NO: 479: 



Met Lys Leu Leu Thr His Asn Leu Leu Sar Ser His Val Arg Gly Val 

-15 -10 -5 1 

Gly Ser Arg Giy Phe ?ro Leu Arg Leu Gin Ala Thr Giu Val Arg lie 

5 10 13 

Cys Pro Vai Giu Phe Asn Pro Asn Phe Val Ala Arg Arg 

20 25 30 



(2) INFORMATION FOR SEQ ID NO: 480: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAuME/KEY: sig^peptide 

(B) LOCATION: -41.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.6 

seq VSAGSLLLPAPQA/EX 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 80: 

M-t Giy Xaa Phe Ser Arg Arg Thr Phe Cya Giy Arg Ser Giy Arg Ser 
-40 -35 -30 

CV3 Arg Gly Gin Leu Val Gin Val Ser Arg Pro Giu Val Ser Ala Giy 
-25 ' -20 -15 

Leu Leu Pro .Ma Pro Gin Aia Giu Xaa His Ser Ser Xaa Xaa 

-ri 1 5 

Tyr Pro Arg Pro Lvs Scr Leu Leu Pro Lys Met Gly 
10 15 20 
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(2: I^JFORtC^TIOM FOR SZQ ID NO: 42 1: 

ii) SEQUiNCi CHARACT£RI5T:C3: 

(A) LENGTH: 79 ammo acics 
(3) TY?E: P^.UIO ACID 
(D) TOPOLOGY: LINEAR 

■ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapians 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig_pe?tide 
(3) LOCATION: -55. .-1 

(C) IDENTIFICATIOM METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.6 

seq CALSLPDAPGASG/GR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481: 



Met Glu Giy Gly Val Arg Leu Asp 
-55 -50 

Giy Val Ala Vai 3er Glu Leu Pro 
-35 

Pro Glu Gly His Gly Pro Gly Leu 
-20 

Aso Ala Pro Gly Ala Ser Gly Gly 
-5 1 

Giv Gin Gin His Thr Gly Leu Pro 
10 15 



Leu Ser Ala Cys Gly Glu Thr Ser 
-45 -40 

Ala Ser Glu Thr Ala Ala Leu Val 
-30 -25 

Arq Ala Cys Ala Leu Ser Leu Pro 
-15 -10 

Arg His His Leu lie Leu val Pro 
5 

Ala Ser His val His Pro Gin 
20 



(2) INFORMATION FOR SEQ ID NO: 432: 

(i) SEQCENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 

(iK) FE-MURE: 

( A ) NAME / KE Y : s i g_pept ide 
fz) location:: -147. -1 

iC; IDEMTXFICATION METHOD: Vop. Heijne rr.atrix 
(D; OTHER INFORMATION: score 4.5 

seq TLLSFAALT.--AFS/VL 
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(xi) - SEQUENCE DESCRIPTION: SEQ ID NO: ^82: 



Met Tar Leu Leu Ser ?he Aia Aia Leu Tnr Ala Ala Phe Ser Vai Leu 
•-10 -5 1 

Pro Lys 



(2) INFORMATION FOR SEQ ID NO: 433: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 8 amino acids 
(3) TYPE: AMINO ACID 
{D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(Fl TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -21. .-1 

(C) IDENTIFICATION METHOD: Von Heijp.e matrix 
(0) OTHER INFORMATION: score 4.4 

seq GLSKLQFAPFSSA/LD 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483: 



Met Aia Ala Ala Thr Gly Asp Pro Giy Leu Ser Lys Leu Gin Phe Ala 
-20 -15 -10 

Pro Phe Ser Ser Ala Leu Asp Vai Giy Phe Trp His Glu Leu Thr Gin 
-5 1 5 10 

Lvs Lys Leu Asn Giu Tvr Arg Leu Asp Glu Aia Pro Lys Asp lie Lys 
15 " 20 25 

Giy Tyr Tyr Tvr Asn Gly Asp Ser Aia Gly Xaa Pro Aia Arg Leu Thr 
30 * 35 40 

Leu Giu Phe Ser Ala Phe Asp Met Ser Ala Pro Thr Pro Ser 
45 50 55 



(2) INF0RM2,TI0N FOR SEQ ID NO: 484: 

(i) SEQUENCE CHARACTERISTICS: 

{A> LENGTH: 32 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

lii) :':OLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

:A) 0RGA::I3M: Homo Sapiens 
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i?) tissue: TYPE: Spleen 

(IX) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: -27 . . -1 

{Ct IDENTIFICATION METHOD: Von Hei]ne nacrix 
(D) OTHER INFORMATION: score 4.4 

seq L£KSLLLV?3XLS/LL 

(xi) SEQUENCE DESCRIPTION*: SEO 10 NO: 434: 



Met Phe Tr.r 3er ?hr Gly Ser Ser Gly Lea Tyr :*ys Ala Pro Leu Ser 

-25 -20 -15 

Lys Ser Leu Leu Leu Vai Pro Ser Xaa Leu Ser Leu Leu Xaa Aia Gin 

-10 -5 1 5 



(2) INFORMATION FOR SEQ ID NO: 4 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A} NAME/KEY: sig^peptide 
(3] LOCATION: -40.. -1 

(C) IDENTIFICATION METHOD: Von Heijne niatrjLx 

(D) OTHER INFORMATION: score 4.3 

seq ITLVSAAPGKVIC/Ei'I 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 485: 



Met Thr Ser Met Thr Gin Ser Leu Arg GIu Vai lie Lys Aia Met Thr 
-40 -35 -30 -25 

Lvs Ala Arg Asn Phe Glu Arg Vai Leu Giy Lys lie Thr Leu Vai Ser 
-20 -15 -10 

Ala Aia Pro Giy Lys Vai lie Cys Glu Met Lys Vai Glu Giu Glu His 

-5 i 

Thr Asn Ala lie Giy Thr Leu His Giy Giy Leu Thr Aia Thr Leu Vai 
10 15 20 

i.sp Asn He 3^r Tar ^!et Aia Leu Leu Cys Thr Giu Arg Giy Aia Pro 
25 30 35 40 

Giy Val Ser 
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(2) INFORMATIOW TOR SZQ ID NO: 4S6: 

(I) SIQUEMCI CHARACTERISTICS: 

(A) LENGTH: 8 6 amino acids 

(B) TYPE: AMINO ACID 
(j; TOCOLOGY: LINEAR 

(II) v.OLECjLE TYPE: PROTEIN 

(VI) CRIGIMAL SOURCE: 

(A) ORGANISM : Homo Sapiens 
(?) TISSUE TYPE: Spleen 

(ix) FEATURE: 

{A) KAME/KEY: sig^peptide 
(3) LOCATION: -73.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
ic) OTHER INFORMATION: score 4.3 

seq DIILSGLVPGST7/LH 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486: 

Met Ala Asp Phe Glv He Ser Ala Gly Gin Phe Val Ala Val Val Trp 
-70 ■ -65 -60 , 

Aso 'vs Se- Ser Pro Val Glu Ala Leu Lys Gly Leu Vai Asp Lys Leu 
-55 -50 -'15 

Gin Ala Leu Thr Gly Asn Glu Gly Arg Val Ser Val Glu Asn He Lys 
-40 -35 -30 

G^n Leu Leu Gin Ser Ala His Lys Glu Ser Ser Xaa Asp He lie Leu 
-25 -20 -15 

Ser Gly Leu Val Pro Gly Ser Thr Thr Leu His Ser Ala Glu He Leu 



A-fl Gill Tls AM Arg Val 
10 



(2) INFORMATION FOR SEQ ID NO: 437: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

:A) ORGANISM: Homo Sapiens 
•Fi TISSUE TYPE: Ovary 

(ix) rr.ATURE: 

..-.) NAME/KEY: sig_peptide 
■?.) LOCATION: -I4T.-I 
:» :dEST:FTCATI0N method: Von Heijne ff;atrix 
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.Ji OTrilR I^rORMATION': score 4.2 

seq GILLGLLLLGHLT/VR 

{xi) SEQUENCE DESCKIPTION: SEQ ID NO: 487: 

M«c Glv I'e Leu Leu Glv Leu Leu Leu Leu Gly His Lej Thr Vai Arc 
' -10 ' -5 1 



(2) INFCRM^.TION FOR SEQ 10 NO: 488: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 arr.ino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY; LINEAR 

(ii; MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
iT) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: -19.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 
O) OTHER INFORMATION: score 4.1 

seq LLLGQRCSLKVSG/QE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488: 



Met Ph*> Leu T.-^r Vai Lys Leu Leu Leu Gly Gin Arg Cys Ser Leu Lys 
.15 -10 -5 

Vai S-^r G^v G'n Giu Ser Vai Ala Thr Leu Lys Arg Leu Vai Ser Arg 

1 5 10 

Arg Leu Lys Vai Pro Giu Giu Gin Gin His Leu Leu ?he Arg Giy Gin 
15 ' 20 25 

Leu Leu Giu Asd Aso Lvs His Leu Ser Asp Tyr Cys He Gly Pro Asn 
30 ' * 35 40 45 

Ala Se'- lia Asn Vai He Met Gin Pro Leu Giu Lys Met Ala Leu Lys 
50 55 60 

Giu A^a His Gin Pro Gin Thr Gin Pro Leu Trp His Gin Leu Giy Leu 

65 70 75 

Vai Leu A^5 Lvs His ?he Giu Pro Gin Asp Ala Lys Ala Vai Leu Gin 
3^. * 85 50 

Leu Leu Arr Giu His Giu Giu Arg Leu Gin Lys lie Ser Leu Giu 
95 100 105 

^is Leu Leu Ala Gin Tvr Leu Leu Ala Giu Giu Leu Thr Trp 

110 " 115 ' 120 125 
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{2) INFORt^L^TIOK FOR SZQ ID MO: 489: 

(i) SECCJENCH: CHARACTERISTICS: 

{A) LENGTH: ICS amino acids 
iS) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(yi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -32,. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.1 

seq RLLSSLLLTMSNN/NP 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489: 



Met Asn Vai lie Asp His Val Arg Asp Met Ala Ala Ala Gly Leu His 
-30 -25 -20 

Ser Asn Vai Arg Leu Leu Ser Ser Leu Leu Leu Thr Met Ser Asn Asn 
-15 -10 -5 

Asn Pro Glu Leu Phe Ser Pro Pro Gin Lys Tyr Gin Leu Leu Val Tyr 
15 10 15 

His Ala Aso Ser Leu Phe His Asp Lys Glu Tyr Arg Asn Ala Val Ser 
20 25 30 

Lys Tyr Thr Met Aia Leu Gin Gin Lys Lys Ala Leu Ser Lys Thr Ser 
35 40 45 

Lys Val Arg Pro Ser Thr Gly Asn Ser Aia Ser Thr Pro Gin Ser Gin 
50 " 55 60 

Cys Leu Pro Ser Glu lie Glu Vai Lys Tyr 
65 70 



{2) INFORMATION FOR SEQ ID NO: 490: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 59 amino acids 
(B) TYPE: AMINO ACID 
(0) TOPOLOGY: LINEAR 

(ii) ;'!OLECULE TYPE: PROTEIN 

(vi; ORIGINAL SOURCE: 

(A) ORGANISM: Komo Sapiens 
(F) TISS'JE TYPE: Tescis 
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(ix; - FEATURE: 

{ A ) NAME /KEY: s iq_pep c ide 
:B) LOCATION: -41 . . -I 

:C) IDENTIFICATION METHOD: Von Heijne niacrix 
{D) OTHE^ INFORMATION: score 4.i 

«sq RVt.CPl.r.X.iKA.AA?/KR 

(Ki) SEQUEN'CE DESCRIPTION: SEQ ID NO: 490: 



Met Giy Thr Pro Ser Leu Ser lie leu Leu lie Giy Aia Pro Giu Ser 
-40 -35 -30 

Pro lie Pro Tyr Phe Pro Tyr His Ser Giy Thr Giy Arg Vai Leu Cys 
-25 -20 -i5 -10 

Pro Leu Leu Xaa Ala Aia Ala Aia Pro Lys Arg Asp Val Pro Glu Thr 
-5 1 5 

Giy Leu Thr Arg Gin Leu Lys Arg His Pro Giy 

10 15 



(2) INFORMSiTICN FOR SEQ ID NO: 491: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIM 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: -26.. -i 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.9 

sec HALFVLCLLYAMS/HN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 91: 



Met Val Tyr His Ala Leu Asp Ser Pro As? Asp Asp Tyr His Ala Leu 
-25 -20 -15 

Phe Val Leu Cys Leu Leu Tyr A.ia Met Ser His Asn Lys Giy Met Asp 
-10 -5 : 5 

Pro Glu Lvs Leu Glu Arg lie Gin Leu Pro Vai Pro Asn Ala Ala Glu 

10 15 20 

Lys Thr I'r.r Tvr Asn His Pro His Giy 

30 
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{2; INrORMATIO;-! FOR SEQ ID NC : 4 92: 

(i) SECUHTNCE CHARACTERISTICS: 

(A) LENGTH: 24 ammo acids 
(3) TYPE: At^IMO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Spleen 

(ix) FEATURE: 

(A) NAME/KEY: siq^peptide 
(3) LOCATION: -14.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.8 

seq FIVLSMWLCCGFE/IL 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 92: 



Met Phe He Val Leu Ser Met Trp Leu Cys Cys Gly ?he Glu He Leu 
-10 -5 1 



Gin Thr Lys Ssr Trp Val Ala Gly 
5 10 



(2) INFORMATION FOR SEQ 10 NO: 4 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(r) TISSUE TYPE: Spleen 

(i;.;} FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: -13. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.8 

seq V\/-ILS3XVPLAAM/GV 

(/.i) SEQUENCE DESCRIPTION: SEQ ID NO: 4 93: 



Mtji; vii V.^i Val He Leu Ser Ser Xaa Vai Pro Leu Ala Ala Met Gly 
-15 -10 -5 1 

Va^ Xst Gly Cys Vai .Arg Val Tip 
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(2) IMTORMATIOM FOR SEQ ID NO: 4 94: 

-.1) St:QUE>^CE CHARACTERISTICS: 

(A; LENGTH: 67 amino acids 
(3; TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

Ui) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 

(ix) FEATURE: 

;A) NAME/KEY: sig_peptide 
;B) LOCATION: -15.. -1 

;C) IDENTIFICATION METHOD: Von Hei^ne matrix 
(D) OTHER INFORMATION: score 3,8 

seq AECSSLLHP3VRG/SI 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494: 



Met Leu Aia Giu Cys Ser Ser Leu Leu His Pro Ser Vai Arg Giy Ser 
-15 -10 -5 1 

Il«* Pro Giu Ala Thr Cys Arg Val Leu Pro Cys Giy Pro Leu His Asn 
5 10 15 

Met Ala Val Cys Ser Cys Lys Ala Ser Arg Ser Phe Tyr Cys Asn Phe 
20 25 30 

Arg Ser Leu Arg Leu Ala Val Ser Asp Phe Leu lie Leu Phe Gin Lys 
35 40 45 

Giy Leu Giy 
50 



(2) IMFOP>!ATION FOR SEQ ID NO: 495: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(VI) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(D TISSUE TYPE: Ovary 

(i:<) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: -IST.-I 

\C) IDENTIFICATION METHOD: Vor. Heijtie matrix 
(D) OTHER INFORMATION: score 3.3 

seq ^!ARLL.GLCAWARK/SV 
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(XI), SEQUENCE DESCRIPTION: SEQ ID NO: 495: 

Met Gin Met Ala Arg Leu Leu Gly Leu Cys Ala Trp Ala Arg Lys Ser 
.15 -10 -5 i 

Val Arg Met Ala Ser Ser Arg Met Thr Arc Arg Asp Pro Pro Arc 
5 10 



(2) INFORMATION FOR SEQ ID NO: 4 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Uterus 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -43.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.8 

aeq LISVLYLIPKTLT/TN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 96: 



Met Thr Pro Gin Tyr Leu Pro His Gly Gly Lys Tyr Gin Val Leu Gly 
-40 -35 

Aso Tyr Se- Leu Ala Val Val Phe Pro Leu His Phe Ser Asp Leu lie 
-25 -20 -15 

Ser Val Leu Tyr Leu He Pro Lys Thr Leu Tar Thr Asn Ser Arg 
-10' -5 1 



(2) INFORMATION FOR SSQ ID NO: 4 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 
(3) TYPE: AMINO ACID 
(0) TOPOLOGY: LINEAR 

(li) MOLECULE TYPE: PROTEIN 

ivi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

[LA) FEATURE: 

(A) NAME/KEY: sig^peptide 
(3) LCCATION: -23.. -1 
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(C) IDENTIFICATION METHOD: Von Heijr.9 matrix 
IV;) OTHER INTORMATION: score 3.7 

seq FLPFLXRArACRG/CQ 

(xi) SEQ'JEN'CE DESCRIPTION: SEQ ID NO: 4 97: 

Met val vai Leu Arg Aia Gly Lys Lys Thr Phe Leu Pro Pro Leu Xaa 
-20 

Ara Aia Phe Aia Cys Arg Gly Cys Gla Leu Ala Pro Glu Arg Gly Aia 

G' u A-g Arg Asp Thr Ala Pro Ser Gly Val Ser Arg Phe Cys Pro Pro 
To 15 20 25 

Arg Lys Ser Cys His Asp Trp lie Gly Pro Pro Asp Lys Tyr Ser Asn 
30 35 

Leu Arg Pro Val His Phe Tyr lie Pro Glu Asn Glu Ser Pro Leu Glu 
45 50 55 

Gin Lvs Leu Arg Lys Leu Arg Gin Giu Thr Gla Glu Trp Asn Gin Gin 
50 65 70 

Phe Trp Aia Asn Gin Asn Leu Thr Phe Ser Lys Giu Lys Glu Giu Phe 
75 80 85 

lie His Ser 
90 



(2) INFORMATION FOR SEQ ID NO: 498: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sigjeptide 

(B) LOCATION: -20.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.6 

seq AHLCSDSLPESQQ/QD 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 498: 

Lys Arg Giu Gly Gly Aia Ala His Leu Cys Ser Asp Ser Leu Pro 
-lo -10 -5 

Glu -Ser 31r. Gin Gin Asp Gly Asn His Aia Pro Asn Phe Ser Ser His 
1 5 10 
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(2) :NT0?^L3^TI0N for" SSQ id NO: 499: 

(i: 3£CUENC£ CHARACTERISTICS: 

:A) LENGTH: 4 6 amino acids 
{B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

iiK) FEATURE: 

(A) NAME /KEY: sig_pepcide 
(3) LOCATION: -41. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.6 

ssq PYSLAACPCG3QG/GV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499: 

Met vai Thr Cys Pro Gly Pro Ser Ser Giy Gin Pro Leu Ser Ser Met 
-40 -35 -30 

r Thr Ala Gly Asp Arg Arg Gly Ala Pro Ser Leu Pro Tyr Ser Leu 



Ty 

-25 -20 



15 -10 



Ala Ala Cvs Pro Cys Gly Ser Gin Gly Gly Val Cys Met Arg 
-5 1 ^ 



INFORMATION FOR SEQ ID NO: 500: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Spleen 

ii-A) FEATURE: 

(A) NAME/KEY: sigjpeptide 
(3) LOCATION: -18.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq ALEVIVTLSET.AA/AM 

SEQUENCE DESCRIPTION: SEQ ID NO: 500: 
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M^t G'f Arq Gin Leu Ala Leu Glu Val lie Val T^.r Leu 5er 31u Tr.r 
-15 -10 -5 

Ala Ala Ala Met Leu Arg Lys His Thr Asn lie Val Ala Gin Thr lie 
1 5 10 

Pro Gin Me- Leu Ala Mer Met Val Asp Leu Glu Glu Asp Glu Asp Trp 
15 20 25 30 

Ala Asn Aia As? Glu Leu Glu Aso As? Asp Phe Asp Ser Asn Ala Val 
35 <0 ^5 

Ala Glv Glu Ser Aia Leu Asp Arg Met Ala Cys Gly Leu Giy Giy Lys 
50 55 60 

Leu Val Leu Pro Met lie Lvs Glu His lie Met Gin Met Leu Gin Asn 
65 70 75 

Arg Lys Leu Cys Pro Ser Met Leu 
80 85 



(2) INFORMATION FOR SEQ ID NO: 501: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
(B> LOCATION: -76.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq LASASELPLGSRP/AP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501: 



M,et Giy Aso Tyr Leu Leu Arg Gly 
-75 ' -70 

Aia Asp Cys Gly Thr lie Leu Leu 
-60 -55 

Cys Val Ala Cys Gin Glu Leu Asp 
-40 

Ala Leu Asn Ala Gin Aia Ala Leu 
-25 

r.ia fier Aia Ser Glu Leu Pro Leu 
-10 -5 



Tyr Arg Mez Leu Gly Glu Thr Cys 
-65 

Gin Aso Lys Gin Arg Lys lie Tyr 
-50 -45 

Ser Aso Val Asp Lys Asp .Asn Pro 
-3*5 -30 

Ser Gin Aia Arg Glu His Gin Leu 
-20 -15 

Gly Ser Arg Pro Ala Pro Gin Pro 
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His Gly 



(2/ INrORMATION FOR SEQ ID NO: 502: 

(i) SEQUENCE CHARACTERISTICS: 

■;A) LENGTH: 41 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi} ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Ovary 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: -15. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 8.1 

seq LLYLLVPALFCRA/GG 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502: 



Me- T-p Leu Leu Tyr Leu Leu Val Pro Ala Leu Phe Cys Arg Ala Gly 
^ll ' -10 -5 1 

G'v Ser He Pro He Pro Gin Lys Leu Phe Gly Glu Val Thr Ser Pro 

5 10 15 

Leu Phe Pro Lys Pro Tyr Pro Asn Thr 
20 25 



(2i INFORMATION FOR SEQ ID NO: 503: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 5 amino acids 

(B) TYPE: AMINO ACID 
(0) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Testis 

(ix) FEATURE: 

(A) NAME/KEY: 3ig_peptids 

(B) LOCATION: -58.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score A 

seq l;^VvSPLVRS'-.IS/SN 



(y.i) SEQUENCE DESCRIPTION: SEQ ID NO: 503: 
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M=»t Lvs Leu Glu Phe Thr Glu Lys Asn Xaa Xaa Ser ?he Vai Leu Gin 

-55 -50 -45 

Asn Leu Asn Arg Gin Arg Lys Arg Lys Giu Tyr Tr? Asp Met Ala Leu 

-40 -35 -30 

Ser Va^' Asp Asn His Vai Phe Phe Ala His Arg Asn Vai Leu Ala Ala 

-25 -20 -15 

Vai Ser Pro Leu Vai Arg Ser Leu lie Ser Ser Asn Asp Met Lys Thr 

-10 -5 1 5 

Ala Asp Giu Leu Phe lie Thr lie Asp Thr Lys 

10 15 
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Fom PCT/ISA/21 0 (continuition of ftrtt sheet (1 )) (July 1 992) 



lnttfn«I»nalApplieatiooNo PCT/IB 98/01231 



FURTHER INFORMATION CONTINUEO FROM PCWSA/ 210 



1. Claims: Invention 1: 1-37 all partially 

Nucleic acid conprising sequence as in Seq. ID: 38, 
complementary sequence, fragments, hybridising sequences. 
Polypeptide comprising a signal peptide encoded by said 
nucleotide sequence. Vector encoding a fusion protein 
conprising said signal peptide. Method of directing the 
extracellular secretion of a polypeptide by means O; said 
vector. Method of inporting a polypeptide into a cell by 
means of said signal peptide. Method of making a cDNA 
encoding a secretory protein, partially encoded by said 
nucleotide sequence, corresponding cDNA. Polypeptide encoded 
by said nucleotide sequence, comprising sequence as in Seq. 
10:271, method of making said polypeptide. Method of 
obtaining a promoter located upstream of said nucleotide 
sequence, promoter thereof. 

2. Claims: Inventions 2-233: 1-37 all partially 

Idem as subject 1 but limited to each of the DNA sequences 
as in Seq. 10:39-270, and corresponding polypeptides, where 
invention 2 is limited to Seq.ID:39 and 272. invention 3 is 

limited to Seq. 10:40 and 273, invention 233 is 

limited to Seq. 10:270 and 503. 

For the sake of conciseness, the first subject matter is 
explicitly defined, the other subject matters are defined by 
analogy thereto. 
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